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PHARMACOPHORE RECOMBINATION FOR THE IDENTIFICATION OF 
] 0 SMALL MOLECULE DRUG LEAD COMPOUNDS 

FIELD OF THE INVENTION 

The present invention is directed to novel methods for identifying small 
molecule drug lead compounds. 

15 This invention was made with Government support under Contract No. R01 

GM50353 awarded by the National Institutes of Health. The Government has certain 
rights to this invention. 

BACKGROUND OF THE INVENTION 
In response to the ever increasing demand for novel compounds useful in the 

-° effective treatment of various maladies, the medical research community has 
developed a number of different strategies for discovering and optimizing new 
therapeutic drugs. For the most part, these strategies are dependent upon molecular 
techniques that allow the identification of tightly binding ligands for a given biological 
target molecule. Once identified, these ligands may then carry out their therapeutic 

5 functions by activating, inhibiting or otherwise altering the activity of the molecular 
target to which they bind. 

In one such strategy, new therapeutic drugs are identified by screening 
combinatorial libraries of synthetic small molecule compounds, determining which 
compound(s) have the highest probability of providing an effective therapeutic and 

0 then optimizing the therapeutic properties of the identified small molecule 
compound(s) by synthesizing structurally related analogs and analyzing them for 
binding to the target molecule (Gallop et al., J. Med. Chem. 37:1233-1251 (1994), 
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Gordon et al., J. Med. Chem. 37:1385-1401 (1994), Czarnik and Ellman, Acc. Chem. 
Res. 29:112-170 (1996), Thompson and Ellman, Chem. Rev. 96:555-600 (1996) and 
Balkenhohl et al., Angew. Chem. Int. Ed. 35:2288-2337 (1996)). However, this 
process is not only time consuming and costly, it often does not provide for the 
5 successful identification of a small molecule compound having sufficient therapeutic 
potency for the desired application. For example, while the preparation and evaluation 
of combinatorial libraries of small molecules has proven somewhat useful for new 
drug discovery, the identification of small molecules for difficult molecular targets 
(e.g., such as those useful for blocking or otherwise taking part in protein-protein 
10 interactions) has not been particularly effective (Brown, Molecular Diversity 2:217- 
222 (1996)). 

One issue that limits the success of combinatorial library approaches is that it is 
possible to synthesize only a very small fraction of the possible number of small 
molecules. For example, greater than 10 60 different small molecules having valid 
15 chemical structures and molecular weights under 600 daltons can be envisioned. 
However, even the most ambitious of small molecule combinatorial library efforts 
have been able to generate libraries of only tens to hundreds of millions of different 
compounds for testing. Therefore, combinatorial technology allows one to test only a 
very small subset of the possible small molecules, thereby resulting in a high 

20 probability that the most potent small molecule compounds will be missed. Thus, 
suitable small molecule compounds having the required availability, activity or 
chemical and/or structural properties often cannot be found. Moreover, even when 
such small molecule compounds are available, optimization of those compounds to 
identify an effective therapeutic often requires the synthesis of an extremely large 

25 number of structural analogs and/or prior knowledge of the structure of the molecular 
target for that compound. Furthermore, screening large combinatorial libraries of 
potential binding compounds to identify a lead compound for optimization can be 
difficult and time-consuming because each and every member of the library must be 
tested. It is evident, therefore, that novel methods for rapidly and efficiently 

30 identifying new small molecule drug leads are needed. 

Living organisms evolve through a process that includes both (1) genetic 
recombination, where sexual reproduction acts to mix and recombine the attributes of 
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the parent organisms to provide progeny having attributes of both parents, and (2) 
natural selection, where only those progeny that are sufficiently "fit" are capable of 
passing their attributes on to the next generation. Approaches that closely model the 
process by which organism evolve have previously been reported for identifying small 
5 molecules that bind to receptors and enzymes (Weber et al., Angew. Client. Int. Ed. 
Engl. 34:2280-2282 (1995) and Singh et al., J. Am. Chem Soc. 1 18:1669-1676 (1996)). 
These approaches are based upon the mathematical method termed "genetic 
algorithms" (Holland, Sci. Am. 66-72 (1992)). Using genetic algorithms, a population 
of different compounds is screened to identify the compounds that bind to the receptor 
1 0 or enzyme (i.e., the "fittest" compounds). A population of progeny compounds is then 
prepared by recombining the building blocks that were used to prepare the "fittest" 
compounds. A screen is then performed to identify the compounds that bind to the 
target with the highest affinity, which are made up of the optimal building block 
combinations. 

15 However, because the building blocks employed in the genetic algorithm 

approach are not preselected, one of two techniques are used to identify tight binding 
ligands: (1) extremely large populations of compounds must be screened and 
recombined, or (2) multiple rounds of screening and recombination are performed on 
relatively small populations where additional building blocks are gradually introduced 

20 through a process that is analogous to genetic mutation. In this second approach, 
many rounds of selection, recombination and building block introduction are required 
to identify the optimal building block combinations in analogy to the many rounds of 
selection, reproduction and mutation that are required in the evolution of living 
organisms. Thus, the use of genetic algorithms is currently limited because of the 

25 large amount of time required for compound preparation and screening, wherein the 
goal of new drug discovery is to identify a potent compound as quickly as possible. 

Another recently reported approach for identifying high affinity ligands for 
molecular targets of interest is by determining structure-activity relationships from 
nuclear magnetic resonance analysis, i.e., "SAR by NMR" (Shuker et al., Science 

™ 274:1531-1534 (1996) and U.S. Patent No. 5,698,401 by Fesik et al.). In this 
approach, the physical structure of a target protein is determined by NMR and then 
small molecule building blocks are identified that bind to the protein at nearby points 
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on the protein surface. Adjacently binding small molecules are then coupled together 
with a linker in order to obtain compounds that bind to the target protein with higher 
affinity than the unlinked compounds alone. Thus, by having available the NMR 
structure of the target protein, the lengths of linkers for coupling two adjacently 

5 binding small molecules can be determined and small molecule ligands can be 
rationally designed. This approach has been useful for identifying compounds that 
bind to FK506 binding protein with a K d = 20 nM (Shuker et al., supra) and to 
stromelysin with a K 0 = 15 nM (Hajduk et al., J. Am, Chem. Soc, 119:5818-5827 
(1997) and Hajduk et al., J. Am, Chem, Soc, 119:5828-5832 (1997)). 

3 However, while theJSAR by NMR method is powerful, it also has serious 

limitations. For example, the approach requires huge amounts of target protein (>200 
mg) and this protein typically must be l5 N-labeIed so that it is useful for NMR studies. 
Moreover, the SAR by NMR approach usually requires that the target protein be 
soluble to >0.3 mM and have a molecular weight less than about 25-30 kDa. 
Additionally, the structure of the target protein is first resolved by NMR, a process 
which often can require a 6 to 12 month time commitment. 

From the above, it is evident that there is a need for novel techniques useful for 
rapidly and efficiently identifying small molecule drug lead compounds that are 
capable of binding with high affinity to a molecular target of interest. We herein 
describe for the first time a method which is based upon pharmacophore 
recombination, wherein a population of small molecule pharmacophores are "pre- 
selected" for the ability to bind to a molecular target and wherein the small molecule 
pharmacophores that bind with the highest affinity are then chemically linked in 
various combinations to provide a library of potential high affinity binding ligands. 
The library of potential binding ligands is then screened using a simple functional 
assay for the presence of one or more compounds that bind to the target molecule with 
very high affinity. 

SUMMARY OF THE INVENTION 

Applicants herein describe a molecular approach for rapidly and efficiently 
identifying small molecule ligands that are capable of binding to a target biological 
molecule with high affinity, wherein ligand compounds identified by the method are 
useful as new small molecule drug lead compounds. The herein described methods 
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allow a library of only the most favorable compounds to be assayed for binding to a 
target biological molecule without the need for screening all possible small molecule 
compounds and combinations thereof for binding to the target as is required in 
standard combinatorial library approaches. More specifically, a library of candidate 
5 target binding fragments is assembled and subjected to a first screen or "pre-screened" 
to identify a subset of that library that bind to a target biological molecule with or 
below a certain dissociation constant. Those candidate target binding fragments 
identified during this "pre-screening" step as being capable of binding to the target 
biological molecule are then coupled or cross-linked in a variety of combinations 
10 using one or more linker elements to provide a library of potential high affinity 
binding ligands or candidate cross-linked target binding fragments, whose building 
blocks represent the small candidate target binding fragments having the highest 
affinity for the target biological molecule as identified in the "pre-screening" step. 
The library of potential ligands or candidate cross-linked target binding fragments for 

1 5 binding to the target biological molecule is then screened a second time to identify 
those members that exhibit the lowest dissociation constant . for binding to the target 
biological molecule. Because the library of candidate target binding fragment 
building blocks is initially "pre-screened" to select for a much smaller set of the most 
favorable building blocks, the most productive building block and cross-linker 

20 combinations can be identified without the laborious task of screening all possible 
combinations of all building blocks coupled together by a set of linkers. The process 
of identifying high affinity drug lead compounds is therefore, greatly expedited. 

With regard to the above, one embodiment of the present invention is directed 
to a method for identifying drug lead compounds that bind to a biological target 

25 molecule of interest, wherein the method comprises the steps of: 

(a) Assembling a library of candidate target binding fragments (CTBF) 
capable of being chemically cross-linked by a cross-linker element to provide 
candidate cross-linked target binding fragments for binding to the target biological 
molecule; 

3° (b) screening the library of candidate target binding fragments to identify at 

least first and second candidate target binding fragments which bind to the target 
biological molecule; 



- 5 - 



« '» 

WO 99/49314 



PCT/US99/06734 



(c) chemically cross-linking the at least first and second candidate target 
binding fragments or structurally related analogs thereof with a cross-linker element to 
provide a library of candidate cross-linked target binding fragments for binding to the 
target biological molecule; and 
5 (d) screening the library obtained in (c) to identify a drug lead compound 

that binds to the target biological molecule. 

In various preferred embodiments, the library of candidate target binding 
fragments may comprise compounds of less than 500 daltons, may comprise simple 
aldehydes, amines, amides, carbamates, ureidos, sulfonamides, alcohols, carboxylic 
10 acids, thiols, aryl halides, alkenes, alkynes, ketones, ethers and/or oximes and/or may 
bind to the target biological molecule with a K„ of 10 mM or lower. In a particularly 
preferred embodiment, the library or candidate target binding fragments may comprise 
oxime compounds, wherein the structurally related aldehyde analogs of those oxime 
compounds are capable of being chemically cross-linked via an O.O'-diamino- 
15 alkanediol cross-linker. Target biological molecules that find use in the described 
methods include, for example, proteins, nucleic acids and saccharides, preferably 
proteins. Preferred TBM's include human or human pathogen proteins, especially 
enzymes, human hormones, human receptors and fragments thereof. These TBM's 
may all contain atoms of naturally occuring isotopic abundance. 
20 In other preferred embodiments, the library of candidate cross-linked target 

binding fragments comprises candidate cross-linked target binding fragments of less 
than about 1000 daltons, that may be homo- or heterodimeric having a K„ for the TBM 
of from about 500 uM to about 500 nM or lower. 

Another embodiment of the present invention is directed to a method for 
25 inhibiting the interaction between first and second biological molecules, wherein the 
method comprises the step of contacting a system comprising both the first and second 
biological molecules with a binding inhibitory amount of a candidate cross-linked 
target binding fragment identified by the above described method, wherein the 
candidate cross-linked target binding fragment binds to one of the first or second 
W biological molecules and inhibits their ability to interact. 



- 6 - 



WO 99/49314 



PCT/US99/06734 



A further embodiment of the present invention is directed to a drug lead 
compound made by the the method described herein, where the compound is 
represented by the formula: 

] — LGt- | XL I — LG s-{ 



TBF„ 



TBF„ 



and 



10 



15 



20 



25 



.10 



TBF m -part A 



TBF n -part C 



LGr-fxr]— LG 4 



TBF m -part B 



TBF n -part D 



where 

TBF m represents a first TBF selected from step (d); 
TBF n represents a second TBF selected from step (d); 

TBF m -part A and B represent TBF m from step (d) where each fragment is 
bonded to a single atom in LG 3 ; 

TBF n -part C and D represent TBF n from step (d) where each fragment is 
bonded to a single atom in LG 4 ; 

XL represents a cross-linker of the formula 
-(C^C.-alkyl-L'-L'-L'-L'-L^C^Q-alkyl)-; 

LG, and LG 2 are linking groups independently selected from the group - 
C(R>N-0-, -0-N=C(R>, -CHj-N(R a )-, -NCRJ-CH,-, -C(=0)-N(R a K -N(R>C(=0)-, 
-N(R>C(=0)-0-, -0-C(=0)-N(R>, -N(R>C(=0)-N(R b )-, -N(R>C(=0)-N(R b )-, - 
SO r N(RJ- and -N(R a )-S0 2 -; 

LG 3 and LG 4 are linking groups independently selected from the group >C=N- 
O-, -0-N=C<, -CH,-!**, >N-CH 2 -, -C(=0)-N<, >N-C(=0)-, >N-C(=0)-0-, -0-C(=0)- 
N<, >N-C(=0)-N(R h K -N(R a )-C(=0)-N<, -S0 2 -N< and >N-S0 2 -, where < and > 
represent two bonds linking CTBF- part A, B, C, or D to the single N or C atom in 
LGj or LG 4 ; 

R a and R h are independently selected from the group 
hydrogen, C r C (0 -aikyl, C 0 -C l0 -alkyl-C 6 -C l0 -aryl, C 6 -C l0 -aryl-C 0 -C l0 -alkyl, C 0 -C I0 -alkyi- 
heterocycle-C 0 -C l0 -alkyl, C r C 6 -alkyl-NH-C r C 6 -alkyl, C 0 -C l0 -alkyl-O-C 0 -C l0 -aIkyl, C 0 - 
C I0 alkyl-C(=O)-C w -C 10 .alkyI, C 0 -C l0 -a!kyl-NH-C(=O)-C 0 -C l0 -alkyl 1 C 0 -C l0 -alkyl-O- 
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C(=O)-C o -C l0 -aikyl, where any alkyl, aryl or heterocycle is optionally substituted 
with C r C I0 -attcyl, C r C l0 -alkoxy, C 6 -C ltr aryl, C 6 -C t0 -aryloxy, halo (F, CI, Br, I), 
hydroxy, carboxy, amino, nitro and 8(0)0.3; 

TBF m , TBF n , TBF m -part A, TBF m -part B, TBF n -part C and TBF n -part D are 
5 each independently represented by formula I 

-A-(Cycle l)-B-(Cycle 2)-E (I) 

Where 

Cycle 1 and Cycle 2 are independently present or absent and are selected from 
'° a mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring, each ring having 5, 
6 or 7 atoms in the ring where the ring atoms are carbon, or from 1-4 heteroatoms 
selected from; nitrogen, oxygen, and sulfur, and where any sulfur ring atom may 
optionally be oxidized and any carbon ring atom may form a double bond with O, 



15 



25 



10 



nil' n 

NR and CR R , each ring nitrogen may be substituted with R and any ring carbon 
may be substituted with R^; 



A and B are independently selected from 





L 3 - — L 2 


L 1 








20 


L 4 — L 3 — 


— L 2 — L 1 - 






L 5 — L 4 


L 3 L 2 



and 



where: 



L 1 is absent or may be selected from oxo (O), S(O) , C(=0), C(=N-R n ), 

s 

C(=CR l R r ), C(R 1 R 1 '), CXR 1 ), C, het, N(R n ) or N; 

L 2 is absent or may be selected from oxo (O), S(O) , C(=0), C(=N-R n ), 

s 

C(=CR 2 R 2 '), C(R 2 R 2 ), C(R 2 ), C, het, N(R n ) or N; 

L 3 is absent or may be selected from oxo (O), S(O) , C(=0), C(=N-R n ), 

s 

C(=CR 3 R 3 '), C(R 3 R 3 ') C(R 3 ), C, het, N(R n ) or N; 
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L is absent or may be selected from oxo (O), S(O) , C(=0), C(=N-R n ), 

s 

C(=CR 4 R 4 '), C(R 4 R 4 '), C(R 4 ), C, NR n or N; and 

L 5 is absent or may be selected from oxo (O), S(O) , C(=0), C(=N-R n ), 

s 

C(R 5 R 5 '), C(=CR 5 R 5 \ C(R 5 ), C, NR n or N; 

R 1 , R 1 ', R 2 , R 2 ', R 3 , R 3 \ R 4 , R 4 ', R 5 and R 5 ' each are independently 
selected from R a , R a \ R c and U-Q-V-W; where s is 0-2 

Optionally, each R*-R 5 or NR n together with any other R^R 5 or NR n may 

10 

form a mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring, each ring being 
a homo- or heterocycle having 5, 6 or 7 atoms in the ring, optionally each ring 
containing 1-4 heteroatoms selected from N, O and S where any ring carbon or sulfur 

atom may optionally be oxidized, each ring nitrogen optionally substituted with R n 

1 5 and each ring carbon optionally substituted with R d ; 

E is -L*-L 2 -L 3 -R a ; 

R a is selected from the group; hydrogen, halo(F, CI, Br, I), halo(F, CI, Br, I)- 
C ][ -C 11 alkyl, halo(F, CI, Br, IK^-Cj ^lkoxy, hydroxy-Cj-Cj ^lkyl, cyano, 

20 isocyanate, carboxy-Cp-C^alkyl, amino, Cq-Cj jalkyl-amino-CCj-Cgalkyl), C Q - 

Cj jalkyl-amino-di-CCj-Cgalkyl), aminocarbonyl, ^i" C i jalkylcarbonylamino, 

carboxamido, carbamoyl, carbamoyloxy, formyl, formyloxy, azido, nitro, hydrazide, 
hydroxamic acid, imidazoyl, ureido, thioureido, thiocyanato, hydroxy, Cj-C^alkoxy, 

25 mercapto, sulfonamido, het, phenoxy, phenyl, benzyl, benzyloxy, benzamido, tosyl, 
morpholino, morpholinyl, piperazinyl, piperidinyl, pyrrolinyl. imidazolyi and indolyl; 

R is selected from the group of C 0 -C 10 alkyl-Q-C Q -C 6 alkyl, Cg-C^alkenyl- 

Q-C 0 ~C 6 alkyl, C 0 -C 1() alkynyl-Q-C 0 -C 6 alkyl, C 3" C j jCycIoalkyl-Q-Cg-Cgalkyl, C 3 - 

30 C 1() cycloalkenyl-Q-C 0 -C 6 alkyl, C r C 6 aIkyl-C 6 -C 12 aryI-Q-C 0 -C 6 alkyl, C^C^aryl- 

Cj-Cgalkyl-Q-CQ-Cgalkyl, C 0 -C 6 a!kyl-het-Q-C 0 -C 6 alkyl, C 0 -C 6 alkyl-Q-het-C 0 - 
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C 6 alky, whererany aryl or het is optionally substituted with 1-3 R d and any alkyl, 



alkenyl or alkynyl is optionally substituted with 1-3 R a ; 
a a' 

5 R and R may join to form a 3-7 member homocyclic ring substituted with 1- 

3R a ; 

R c is selected from hydrogen and substituted or unsubstituted; amino, O-C^- 
C g alkyl, amino-CC^Cgalkyl), amino-dHC^Cgalkyl), C^C^alkyl, C 2 -C 1() alkenyl, 
10 C 2 -C 1Q alkynyl, C^CjjCyclpalkyl, C 3 -C 1() cycloalkenyl, C^Cgalkyl-Cg-C^aryl, 
Cg-C^aryl-Cj-Cgalkyl, Cj-Cgalkyl-het, het-Cj -Chalky 1, Cg-C^aryl and het, where 

the substituits on any alkyl, alkenyl or alkynyl are 1-3 R a and the substituents on any 
aryl or het are 1-3 R d ; 



1 5 R d is selected from R h and R p ; 



R h is selected from the group OH, OCF 3 , OR c , SR m , halo(F, CI. Br, I), CN, 

isocyanate, N0 2> CF y C 0 -C 6 alkyl-NR n R n \ C 0 -C 6 alkyl-C(=O)-NR n R n \ C Q - 

20 C 6 alkyl-C(=0)-R a , C^Cgalkyl, Cj-Cgalkoxy, C 2 -C g alkenyl, C 2 -C g alkynyl, C 3 - 
CgCycloalkyl, C 3 -C 6 cycloalkenyl, C^Cgalkyl-phenyl, phenyl-Cj-C^lkyl, 
Cgalkyloxycarbonylamino, C^Cgalkyloxycarbonyl- C Q -C 6 alkyl, phenyl-C Q - 
C 6 alkyloxy, Cj-^alkyl-het, het-C^C^lkyl, SO^het, O-Cg-C^aryl, SO^Cg- 

25 

C^aryl, SO^C^-Cgalkyl and het, where any alkyl, alkenyl or alkynyl may optionally 

be sustituted with 1-3 groups selected from OH, halo(F, CI, Br, I), nitro, amino and 
aminocarbonyl, where the substituents on any aryl or het are 1-2 hydroxy, halo(F, CI, 
Br, I), CF 3 , Cj-Cgalkyl, C^-Cgalkoxy, nitro and amino; 

30 R m is selected from hydrogen, S-C^Cgalkyl, C^OK^-Cgalkyl, C(=0)- 

NR n R n ', Cj-C 6 alkyl, ha!o(F, CI, Br, I)-C 1 -C (; alkyl, benzyl and phenyl; 
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R n is selected from the group R c , OH, OCF 3> OR°, CN, isocyanate, NH- 

C(=0)-0-R c , NH-C(=0)-R c , NH-C(=0)-NHR c , NH-S0 2 -R s , NH-S0 2 -NH-C(=0)- 

R c , NH-C(=0)-NH-S0 2 -R s , C(=0)-0-R°, C(=0)-R c , C(=0)-NHR c , C(=0)-NH- 

5 C(=0)-0-R°, C(=0)-NH-C(=0)-R c , C(=0)-NH-S0 2 -R s , C(=0)-NH-S0 2 -NHR c , 

S0 2 -R s , S0 2 -0-R°, S0 2 -N(R c ) 2 , S0 2 -NH-C(=0)-0-R°, S0 2 -NH-C(=0)-0-R° and 

S0 2 -NH-C(=0)-R c ; 

10 R° is selected from hydrogen and substituted or unsubstituted Cj-C^alkyl, C Q - 

C 6 alkyl-C 6 -C 1() aryl, Cj-Cgalkylcarbonyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 3 - 

CgCycloalkyl and benzoyl, where the substituits on any alkyl are 1-3 R a and the 

substituents on any aryl are 1-3 R^; 

R p is selected from the group; OH, COOH, COH, NH 2 , C 0 -C 6 alkyl, halo(F, 

CI. Br, I), CN, isocyanate, OR°, SR m , SOR°, NO^ CF 3> R c , NR n R n ', N(R n )-C(=0)- 

O-R 0 , N(R n )-C(=0)-R c , S0 2 -R s , C 0 -C 6 alkyl-SO 2 -R s , C 0 -C 6 alkyl-SO 2 -NR n R n ', 

C(=0)-R c , 0-C(=0)-R c , C(=0)-0-R° and C(=0)-NR n R n ', where the substituits on 
any alkyl, alkenyl or alkynyl are 1-3 R a and the substituents on any aryl or het are 1-3 
R d ; 

„ R s is a substituted or unsubstituted group selected from; C.-CLalkyl, C 0 - 

C g alkenyl, C 2 -C g alkynyl, C 3 -C g cycIoalkyl, C 3 -C 6 cycloalkenyl, C 0 -C 6 alkyl- Cg- 
C 1() aryl, C 6 -C 1() aryl-C 0 -C 6 alkyl, C Q -C 6 alkyl-het and het-C Q -C 6 alkyl, where the 

substituits on any alkyl, alkenyl or alkynyl are 1-3 R a and the substituents on any aryl 

10 or het are 1-3 R d ; 

het is any mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring where 
at least one ring is a 5-, 6- or 7-membered ring containing from one to four 
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heteroatoms selected from the group nitrogen, oxygen, and sulfur, the 5-membered 
ring having from 0 to 2 double bonds and the 6- or 7-membered ring having from 0 to 
3 double bonds and where any carbon or sulfur atoms in the ring may optionally be 
oxidized, and where any nitrogen heteroatom may optionally be quaternized and where 

any ring may contain from 0-3 R d ; 

U is an optionally substituted bivalent radical selected from the group; C^- 

Cgalkyl, C 0 -C 6 alkyl-Q, C 2 -C 6 alkenyl-Q, and C^-C^alkynyl-Q, where the substituits 

on any alkyl, alkenyl or alkynyl are 1-3 R a ; 

Q is absent or is selected from the group; -O-, -S(O) -, -S0 0 -N(R n )-, -N(R n )-, 

s 

-N(R n )-C(=0)-, -N(R n )-C(=0)-0-, -N(R n )-S0 2 -, -C(=0)-, -C(=0)-0-, -het-,-C(=0)- 

N(R n )-, -PO(OR c )0- and -P(0)0-, where s is 0-2 and the heterocyclic ring is 
h 

substituted with 0-3 R ; 

V is absent or is an optionally substituted bivalent group selected from Cj- 

C 6 alkyl, C 3 -C g cycloalkyl, C 0 -C 6 alkyl-C 6 -C 1() aryl, and Cg-Cgalky-het, where the 

a d 
substituits on any alkyl are 1-3 R and the substituents on any aryl or het are 1-3 R ; 

W is selected from the group; hydrogen, -OR°, -SR m , -NR n R n ', -NH-C(=0)- 

0-R°, -NH-C(=0)-NR n R n ', -NH-C(=0)-R c , -NH-S0 2 -R s , -NH-SC» 2 -NR n R n ' , -NH- 

S0 2 -NH-C(=0)-R c , -NH-C(=0)-NH-S0 2 -R s , -C(=0)-NH-C(=0)-0-R°, -C(=0)-NH- 

C(=0)-R c , -C(=0)-NH-C(=0)-NR n R n ', -C(=0)-NH-S0 2 -R s , -C(=0)-NH-S0 2 - 

NR n R n ', -C(=S)-NR n R n \ -S0 2 -R S , -SO 2 -0-R 0 , -S0 2 -NR n R n ', -S0 2 -NH-C(=0)-0- 

R°, -S0 2 -NH-C(=0)-NR n R n ', -S0 2 -NH-C(=0)-R c , -0-C(=0)-NR n R n ', -0-C(=0)- 

R c , -0-C(=0)-NH-C(=0)-R c , -0-C(=0)-NH-S0 2 -R s and -0-S0 2 -R s ; 

Optionally, TBF m -part A together with TBF„-part B and TBF„-part C together 
with TBF„-part D may independently form Cycle 1 substituted with -B-(Cycle 2)-E. 
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A drug lead precursor or intermediate of this invention is represented by C 0 -C 2 - 

alkyl-L^L^L^-L^L^Co-Cj-alkyl where L 1 through L 5 are defined above. Additional 
embodiments of the present invention will become evident to the ordinarily skilled 
artisan upon a review of the present specification. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a synthetic reaction wherein an aldehyde is reacted with O- 
methyl hydroxylamine to produce an O-methyl oxime compound. 

Figure 2 shows a variety of organic aldehyde pharmacophore molecules which 
after conversion to )-methyl oximes were identified as being capable of inhibiting the 
interaction between CD4 and gpl20. 

Figure 3 shows chemistry useful for chemically coupling two aldehydes via an 

0. 0'-diamino-alkanediol linker to produce both heterodimeric and homodimeric 
oxime compounds. 

Figure 4 shows a variety of aldehyde pharmacophores found to be highly 
efficient in dimeric form for inhibiting the interaction between CD4 and gpl20. 

Figure 5 shows an O-methyl oxime compound found to be particularly 
effective in dimeric form for inhibiting the interaction between CD4 and gpl20. 

Figure 6 shows a specific ligand having particularly high activity for inhibiting 
the interaction between CD4 and gpl20. 

Figure 7 shows a synthetic reaction wherein an aldehyde is reacted with a 
dimethylamine in the presence of support-bound triacetoxyborohydride to give rise to 
an N,N-dimethylamine organic compound. 

Figure 8 shows a chemical synthesis sequence resulting in the production of a 
ligand of the present invention. 

Figure 9 shows a flow diagram for the fragment assembly sequence resulting in 
the production of a drug lead compound of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

1. General Description 
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The present invention provides a rapid and efficient method for identifying 
small molecule candidate target binding fragments or ligands that are capable of 
binding with high affinity to target biological molecules (TBM) of interest. The 
compounds identified by the subject method find use, for example, as drug lead 
compounds for the development of novel therapeutic drugs. The subject method 
involves the assembly of a library of small organic candidate target binding fragments 
which are capable of being chemically cross-linked via a linker element. The library 
of organic candidate target binding fragments may be "pre-screened" in order to 
identify members of the library that are capable of binding to a target biological 
molecule. At least a portion pf the small organic candidate target binding fragments 
identified during the "pre-screen" as being capable of binding to the target are then 
chemically cross-linked to one another in various combinations to provide a library of 
potential or candidate cross-linked target binding fragments for binding to the target 
biological molecule. The library of candidate cross-linked target binding fragments 
having potential to be high affinity antagonists (or agonists) are then screened to 
identify drug lead compounds that bind to the target biological molecule with high 
affinity. The step of "pre-screening" the library of small organic candidate target 
binding fragments to identify those that are capable of binding to the target allows one 
to limit the library of potential binding ligands to only those that are comprised of the 
most favorable organic candidate target binding fragment building blocks, thereby 
decreasing the required complexity of the library of potential ligands while increasing 
the probability of identifying a molecule exhibiting high binding affinity for the 
biological target molecule. 

One embodiment of the present invention is directed to a method for 
identifying a drug lead compound that binds to a target biological molecule of interest. 
The subject method involves assembling a library of organic candidate target binding 
fragment(s) that are capable of being chemically cross-linked via a chemically 
compatible cross-linker to provide candidate cross-linked target binding fragment(s) 
for binding to the target biological molecule. In this regard, the phrase "assembling a 
library of organic candidate target binding fragments" is to be construed broadly and is 
intended to encompass all means by which one may obtain a library comprising two 
or more organic compounds which include, for example, obtaining such compounds 



- 14 - 



WO 99/49314 



PCT/US99/06734 



from a commercial or non-commercial source, synthesizing such compounds using 
standard chemical synthesis technology or combinatorial synthesis technology (see 
Gallop et al. (1994), supra, Gordon et al. (1994), supra, Czarnik and Ellman (1996), 
supra, Thompson and Ellman (1996), supra and Balkenhohl et al. (1996) supra) and 
5 obtaining such compounds as degradation products from larger precursor compounds, 
e.g. known therapeutic drugs, large chemical molecules, and the like. 

The candidate target binding fragments (CTBF) and are, for the most part, 
small water soluble organic molecules that have one or more chemically reactive 
functionalities, referred to as linkable (or linkage) functional groups (LFG) (or sites 
10 that may be readily converted to a chemically reactive functionality using standard 
technology (BLFG)) that provide a site for coupling to another compound or candidate 
target binding fragment via a chemically compatible cross-linker. Thus, the candidate 
target binding fragments of the present invention are capable of being chemically 
coupled to one another via a cross-linker to provide candidate cross-linked target 
15 binding fragments for binding to the target biological molecule, meaning that the 
candidate target binding fragment compounds have a reactive functionality, or a site 
that can be readily chemically converted to a reactive functionality, where a 
chemically compatible cross-linker may covalently attach thereto, thereby allowing 
multimerization of the candidate target binding fragments through the cross-linker. 
. 20 "Ligands", "candidate ligands' or "candidate cross-linked target binding fragments " 
for binding to a "target biological molecule" for purposes herein are compounds that 
are obtained from reacting two or more organic compounds, which may be the same or 
different, preferably different, with one or more cross-linker so as to produce a 
molecule comprising two or more target binding fragments and one or more cross- 

25 linker. Such ligands are referred to herein as candidate cross-linked target binding 
fragments(CXL-TBF). 

Candidate target binding fragments having the linkage functional group 
modified or blocked so that it contains substantially the same linking group as is found 
in the candidate cross-linked binding fragments are . sometimes referred to as 

30 monomers. Monomers or monomelic compounds that find use in the present invention 
include, for example, aldehydes, ketones, oximes, such as O-alkyl oximes, preferably 
O-methyl oximes, hydrazones, semicarbazones, carbazides, primary amines, secondary 
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amines, such as N-methylamines, tertiary amines, such as N,N-dimethylamines, N- 
substituted hydrazines, hydrazides, alcohols, ethers, thiols, thioethers, thioesters, 
disulfides, carboxylic acids, esters, amides, ureas, carbamates, carbonates, ketals, 
thioketals, acetals, thioacetals, aryl halides, aryl sulfonates, alkyl halides, alkyl 
5 sulfonates, aromatic compounds, heterocyclic compounds, anilines, alkenes, alkynes, 
diols, amino alcohols, oxazolidines, oxazolines, thiazolidines, thiazolines, enamines, 
sulfonamides, epoxides, and aziridines, and the like, all of which have chemically 
reactive functionalities (or are directly prepared from precursor compounds that have 
chemically reactive functionalities) capable of linking, either directly or indirectly, to a 
10 cross-linker . In fact, virtuaUy any small organic molecule that is capable of being 
chemically coupled to another small organic molecule may find use in the present 
invention with the proviso that it is sufficiently soluble in aqueous solutions to be 
tested for its ability to bind to a target biological molecule. 

The above described monomers or candidate target binding fragments will 
15 serve as the individual building blocks for candidate cross-linked binding fragments 
prepared therefrom. Candidate target binding fragments that find use herein will 
generally be less than about 2000 daltons in size, usually less than about 1500 daltons 
in size, more usually less than about 750 daltons in size, preferably less than about 500 
daltons in size, often less than about 250 daltons in size and more often less than about 

20 200 daltons in size, although organic molecules larger than 2000 daltons in size will 
also find use herein. Candidate target binding fragments that find use may be 
employed in the herein described method as originally obtained from a commercial or 
non-commercial source (for example, a large number of small organic chemical 
compounds that serve as candidate target binding fragments are readily obtainable 

25 from commercial suppliers such as Aldrich Chemical Co., Milwaukee, WI and Sigma 
Chemical Co., St. Louis, MO) or may be obtained by chemical synthesis. Examples of 
the latter include the preparation of a library of organic oxime compounds from a 
single step condensation of commercially available aldehydes with O-alkyI 
hydroxylamine as described in Example I below and the preparation of a library of 

^° N,N-dimethyIamine candidate cross-linked target binding fragments from the 
reductive amination of commercially available aldehydes and dimethylamine using 
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support-bound triacetoxyborohydride as described in Example II below and by Kaldor 
et ah, TetraJiedron Lett. 37:7193-7196 (1996). 

Libraries of candidate target binding fragments or candidate cross-linked 
target binding fragments which find use herein will generally comprise at least 2 
5 organic compounds, often at least about 25 different organic compounds, more often 
at least about 100 different organic compounds, usually at least about 500 different 
organic compounds, more usually at least about 1000 different organic compounds, 
preferably at least about 2500 different organic compounds, more preferably at least 
about 5000 different organic compounds and most preferably at least about 10,000 or 
10 more different organic compounds. Libraries may be selected or constructed such that 
each individual molecule of the library may be spatially separated from the other 
molecules of the library (e.g., each member of the library is present in a separate 
microtiter well) or two or more members of the library may be combined if methods 
for deconvolution are readily available. The methods by which the library of organic 

1 5 compounds are prepared will not be critical to the invention. 

Once assembled, a library of organic candidate target binding fragments will be 
screened using one of any number of different known assays for the purpose of 
identifying candidate cross-linked target binding fragments that are capable of binding 
to a target biolocical molecule of interest. "Biological target molecules", "target 

20 biological molecules", "target biomolecules", "molecular targets", "biological targets", 
and other grammatical equivalents refer to target biological molecules (TBM) that are 
available (either commercially, recombinantly, synthetically or otherwise) in sufficient 
quantities for use in in vitro binding assays and for which there is some interest for 
identifying a high affinity binding partner. For the most part, target biological 

25 molecules are proteins, including human proteins or human pathogen proteins that 
may be associated with a human disease condition, such as cell surface and soluble 
receptor proteins, such as lymphocyte cell surface receptors, enzymes, such as 
proteases, clotting factors, serine/threonine kinases and dephosphorylases, tyrosine 
kinases and dephosphorylases, bacterial enzymes, fungal enzymes and viral enzymes, 

30 signal transduction molecules, transcription factors, proteins associated with DNA 
and/or RNA synthesis or degradation, immunoglobulins, hormones, receptors for 
various cytokines including, for example, erythropoietin/EPO, granulocyte colony 



- 17 - 



WO 99/49314 



PCT/US99/06734 



stimulating receptor, granulocyte macrophage colony stimulating receptor, 
thrombopoietiYT(TPO), IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-10, IL-11, IL-12, growth 
hormone, prolactin, human placental lactogen (LPL), CNTF, octostatin, various 
chemokines and their receptors such as RANTES, MIPl-oc, IL-8, various ligands and 
receptors for tyrosine kinases such as insulin, insulin-like growth factor 1 (IGF-1), 
epidermal growth factor (EGF), heregulin-cc and heregulin-0, vascular endothelial 
growth factor (VEGF), placental growth factor (PLGF), tissue growth factors (TGF-cc 
and TGF-P), other hormones and receptors such as bone morphogenic factors, folical 
stimulating hormone (FSH), and leutinizing hormone (LH), tissue necrosis factor 
(TNF), apoptosis factor- 1 and -2 (AP-1 and AP-2), and proteins and receptors that 
share 20% or more sequence identity to these, and the like, nucleic acids, including 
both DNA and RNA, saccharide complexes, and the like. 

For the step(s) of screening libraries of candidate target binding fragments or 
candidate cross-linked target binding fragments for members having the ability to bind 
to a target biological molecule of interest, a simple ELISA assay may be used to (a) 
identify member(s) of the library that are capable of binding to the target, and (b) 
determine the approximate K d with which the library member(s) bind to the molecular 
target. While ELISA assays are preferred for screening libraries of organic 
compounds, virtually any in vitro assay that allows one to detect binding of the target 
biological molecule by an organic compound may be employed for screening the 
library, wherein such assays include ELISA, other sandwich-type binding assays, 
binding assays which employ labeled molecules such as radioactively or fluorescently 
labeled molecules, fluorescence depolarization, calorimetry, protein denaturation, 
resistance to proteolysis, gel filtration, equilibrium dialysis, surface plasmin resonance, 
X-ray crystallography, and the like. Such assays either measure the ability of library 
members to bind directly to the target biological molecule or are competition binding 
assays designed to measure the ability of library members to inhibit the interaction 
between the target biological molecule and another molecule that binds to the target 
biological molecule. Any of the above assays may be employed to screen libraries of 
candidate compounds to identify those that bind to a target biological molecule. 

For the step of screening a library of candidate compounds to identify those 
that bind to a target biological molecule, it will be well within the skill level in the art 
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to determine the concentration of the library members to be employed in the binding 
assay. For the most part, the screening assays will employ concentrations of candidate 
compounds ranging from about 0.01 to 10 mM, preferably from about 0.05 to 5 mM. 
The step of pre-screening a library of candidate target binding fragments to 
5 identify those that bind to a target biological molecule allows one to identify and 
isolate only those members of the library that have some binding affinity for the target. 
As such, in contrast to standard combinatorial library approaches, the small organic 
building blocks are "pre-screened" to select a smaller set of compounds that have 
some binding affinity for the target. Thus, the most productive organic compound 
10 building blocks can be identified for incorporation into the potential high affinity 
candidate cross-linked target binding fragments that are prepared therefrom, without 
having to screen all possible combinations of all of the initial candidate target binding 
fragment building blocks. Generally, the candidate target binding fragment library 
members selected as building blocks for subsequently prepared candidate cross-linked 
1 5 target binding fragments are those that have the highest affinity for binding to the 
target biological molecule. For the most part, candidate target binding fragments 
chosen as building blocks for incorporation into the subsequently prepared candidate 
cross-linked target binding fragments are those that bind to the target biological 
molecule with a K d of about lOmM or less, usually about 5rnM or less, more usually 
20 about ImM or less, preferably about 500 |xM or less, more preferably about 100 \xM or 
less and most preferably about 50 fiM or less. However, for some applications, one or 
more of the candidate target binding fragment(s) chosen for incorporation into the 
subsequently prepared candidate cross-linked target binding fragments may have an 
individual K d for the target biological molecule of greater than 10 mM. 

25 

Once candidate target binding fragments that bind to a target biological 
molecule with some desired degree of affinity are identified, at least a portion of those 
compounds (or structurally related analogs thereof) are chemically coupled via a cross- 
linker to provide a library of candidate cross-linked target binding fragments for 
binding to the target biological molecule, wherein those candidate cross-linked target 
binding fragments comprise at least one candidate target binding fragment reacted or 
linked with a cross-linker. Usually two or more candidate target binding fragments (or 
structurally related analogs thereof) linked by a cross-linker are combined and in some 
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cases these two fragments are the same. The two (or more) candidate target binding 
fragments (or structurally related analogs thereof ) incorporated into a candidate cross- 
linked target binding fragment may be the same (i.e., to provide a homodimer or 
homomultimer) or different (i.e., to provide a heterodimer or heteromultimer). Most 
commonly, the two candidate target binding fragments in the candidate cross-linked 
target binding fragment are different. 

By "structurally related analog", "analog" and the like, is meant a fragment 
compound that has the same chemical structure as a fragment identified as being 
capable of binding to the target biological molecule except that the analog has a 
different chemically reactive functionality or linkage functional group(LFG) for 
binding to the cross-linker than does the fragment that was identified as being capable 
of binding to the target biological molecule in the first or pre-screen. The analog may 
also optionally possess or lack one or more substituents that are either lacking or 
present, respectively, on the fragments identified in the pre-screening provided that 
the presence or absence of those substituents does not substantially alter the 
compounds ability to bind to the target. As such, while one may pre-screen a library 
of, for example, candidate oxime compounds to identify candidate oxime fragments 
that bind to the target biological molecule, one can chemically couple not the actual 
oxime compounds identified in the "pre-screening" but rather aldehydes having that 
same chemical structures as the oximes identified in the screen (but which have an 
aldehyde reactive functionality rather than an oxime reactive functionality). The 
present invention, therefore, encompasses not only chemical coupling of the actual 
compounds identified in the initial "pre-screening step" (e.g., aldehydes are pre- 
screened and also subsequently linked), but also the chemical coupling of structurally 
related analogs of those compounds (e.g., oximes are pre-screened but the analogous 
aldehydes are linked). 

As described above, candidate target binding fragment will comprise a 
chemically reactive functionality or linkage functional group (LFG) (or a site that can 
be converted to a chemically reactive functionality (BLFG)) to which a cross-linker 
may be covalently bound, thereby providing a means for cross-linking two or more 
candidate target binding fragments having a LFG ( or analog or blocked form 
thereof) to provide a candidate cross-linked target binding fragment. Therefore, cross- 
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linkers that find use herein will be multifunctional, preferably bifunctional, cross- 
linking molecules that can function to covalently bond at least two fragment 
compounds together via their reactive functionalities or LFG's. Linkers or cross- 
linkers (XL) will have at least one, two or more and preferably only two, chemically 
cross-reactive functional groups (CFG) on the cross-linker (XL). The chemically 
cross- reactive functional groups that are available for bonding to two or more 
candidate target binding fragments, wherein those functional groups may appear 
anywhere on the cross-linker, preferably at each end of the cross-linker and wherein 
those chemically cross-reactive functional groups may be the same or different 
depending upon whether the candidate target binding fragment is to be linked have the 
same or different chemically cross-reactive functional groups. Cross-linkers that find 
use herein may be substituted or unsubstituted straight-chain or branched alkyl, aryl, 
alkaryl, heteroaryl, heterocycle and the like. Preferably straight chain alkyl will 
generally be at least one methylene in length, more generally from 2 to 8 methylenes 
in length, and optionally as many as about 12 or more methylenes or the equivalent in 
length. Cross-linkers may include atoms or groups to increase or improve solubility of 
the library members, such as oxygen atoms interspersed between methylene groups 
creating ether or polyether linkers. Cross-linkers will generally comprise alkyl 
groups either saturated or unsaturated, and therefore, may comprise alkanes, alkenes or 
alkynes. Heteroatoms including nitrogen, sulfur, oxygen, and the like may also be 
appended to the alkyl to form groups such as; alkoxyl, hydroxyalkyl or hydroxy 
groups. Other cross-linking groups such as aryl, especially phenylene or substituted 
phenylene linkers are suitably employed. Usually cross-linker elements will be of 
varying lengths, thereby providing a means for optimizing the binding properties of a 
cross-linked target binding fragment prepared therefrom. 

In particularly preferred embodiments, cross-linkers may be O.O'-diamino- 
alkanediol compounds, preferably O.O'-diamino-C.-C^lkanediol, which are useful for 
chemically coupling aldehyde organic compounds, or any of a variety of different 
diamine compounds, which are useful for chemically coupling aldehyde containing 
compounds. 

Various chemistries may be employed for chemically coupling candidate 
target binding fragments via a cross-linker to provide candidate cross-linked target 
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binding fragments for binding to a target biological molecule. For example, many 
well known chemistries that can be employed for chemically coupling candidate target 
binding fragments(s) via a linker to form candidate cross-linked target binding 
fragments include, for example, reductive aminations between aldehydes and ketones 

5 and amines (March, Advanced Organic Chemistry, John Wiley & Sons, New York, 
4th edition, 1992, pp.898-900), alternative methods for preparing amines (March et al., 
supra, p. 1276), reactions between aldehydes and ketones and hydrazine derivatives to 
give hydrazones and hydrazone derivatives such as semicarbazones (March et al., 
supra, pp. 904-906), amide bond formation (March et al., supra, p. 1275), formation of 

3 ureas (March et al., supra, p.4299), formation of thiocarbamates (March et al., supra, 
p.892), formation of carbamates (March et al., supra, p. 1280), formation of 
sulfonamides (March et al., supra, p. 1296), formation of thioethers (March et al., 
supra, p. 1297), formation of disulfides (March et al., supra, p. 1284), formation of 
ethers (March et al., supra, p. 1285), formation of esters (March et al., supra, p.1281), 
additions to epoxides (March et al., supra, p.368), additions to aziridines (March et al., 
supra, p.368), formation of acetals and ketals (March et al., supra, p. 1269), formation 
of carbonates (March et al., supra, p.392), formation of enamines (March et al., supra, 
p.1284), metathesis of alkenes (March et al., supra, pp.1 146-1 148 and Grubbs et al., 
Acc. Chem. Res. 28:446-452 (1995)), transition metal-catalyzed couplings of aryl 
halides and sulfonates with alkenes and acetylenes (e.g., Heck reactions) (March et al., 
supra, pp. 717-178), the reaction of aryl halides and sulfonates with organometallic 
reagents (March et al., supra, p.662), such as organoboron (Miyaura et ah, Chem. Rev., 
95:2457 (1995)), organotin, and organozinc reagents, formation of oxazolidines (Ede 
et al., Tetrahedron Letts. 38:7119-7122 (1997)), formation of thiazolidines (Patek et 
al., Tetrahedron Letts. 36:2227-2230 (1995)), amines linked through amidine groups 
by coupling amines through imidoesters (Davies et al., Canadian J. Biochem. 50:416- 
422(1972)), and the like. 

The step of chemically cross-linking, via a cross-linker, at least a portion of the 
candidate target binding fragments identified as described above as being capable of 
binding to the target biological molecule or structurally related analogs thereof 
provides a library of candidate cross-linked target binding fragments for binding to the 
target molecule that comprise al least two of the candidate target binding fragments 
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or analogs thereof and the cross-linker. As previously stated the candidate target 
binding fragments incorporated into candidate cross-linked target binding fragments 
may be the same, thereby providing a homomultimer, or different, thereby providing a 
heteromultimer, and libraries of candidate cross-linked target binding fragments 
5 generally comprise both homo- and hetero-multimers. Candidate cross-linked target 
binding fragments for binding to the target molecule are preferably dimeric, however, 
candidate cross-linked target binding fragments that find use may also be trimeric, 
tetrameric, and the like, those compounds being obtained by employing cross-linkers 
having more than two chemically cross-reactive functional groups for cross-linking 
10 purposes. Candidate cross-linked target binding fragments for binding to a target 
biological molecule that find use herein will generally be less than about 1000 daltons 
in size and often less than about 750 daltons in size. 

Libraries of candidate cross-linked target binding fragments for binding to the 
target biological molecule will generally comprise at least 1 candidate cross-linked 
1 5 target binding fragment, usually at least about 20 different candidates, more usually at 
least about 100 different candidates, preferably at least about 200 different candidates, 
more preferably at least about 500 different candidates, most preferably at least 1,000 
different candidates and often 10,000 or more. Libraries of candidate cross-linked 
target binding fragments may be constructed such that each individual molecule of the 

20 library may be spatially separated from the other molecules of the library (e.g., each 
member of the library is in a separate microtiter well) or two or more members of the 
library may be physically combined if methods for deconvolution are readily available. 

Once obtained, libraries of candidate cross-linked target binding fragments for 
binding to the target biological molecule will be screened for the purpose of 

25 identifying a member(s) of the library that is/are capable of binding to the target 
biological molecule with high affinity. For such purposes, any of the above described 
screening assays can be employed, wherein preferably a biological assay such as an 
ELISA assay is employed. 

For the step of screening a library of candidate cross-linked target binding 

■*° fragments to identify one or more that bind to a target biological molecule, it will be 
well within the skill level in the art to determine the concentration of the compounds to 
be employed in the binding assay. We have herein found that candidate cross-linked 
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target binding fragments generated by chemically coupling organic compounds that 
bind to a target biological molecule often exhibit surprisingly high binding affinities 
for the target. For the most part, candidate cross-linked target binding fragments that 
serve as potential drug lead compounds or may have therapeutic efficacy on their own 
5 bind to the target biological molecules with a K d of about 500 nM or less, usually 
about 100 nM or less, more usually about 50 nM or less. However, for various 
applications, one or more drug lead compound(s) having an individual K d for the target 
biological molecule of greater than 500 nM may also find use. 

Another embodiment of the present invention is directed to a method for 
) inhibiting the interaction between first and second biological molecules which bind to 
each other, wherein the method comprises contacting a system comprising those 
molecules with a binding inhibitory amount of a candidate cross-linked target binding 
fragment or drug lead compound identified by the method described above, wherein 
the drug lead compound or cross-linked target binding fragment binds to the first 
biological molecule and inhibits its ability to bind to the second biological molecule. 
For the most part, the first and second biological molecules will be proteins, nucleic 
acids, saccharide complexes, and the like, preferably at least one being a protein, more 
preferably both being proteins. In particularly preferred embodiments, the first or 
second biological molecule may be CD4 or gp!20. In other preferred embodiments, 
the first biological molecule may be a protein wherein the second biological molecule 
is a receptor for that protein, a nucleic acid, either DNA or RNA, that binds to that 
protein or a polysaccharide or the first biological molecule is an enzyme wherein the 
second biological molecule is a substrate for that enzyme. 

Systems that comprise both the first and second biological molecules may be 
either in vitro or in vivo, wherein the first and second biological molecules are situated 
such that they are capable of binding to one another. For in vivo applications, the lead 
of interest may be administered on its own or in pharmaceutical^ acceptable media, 
for example normal saline, PBS, etc. The additives may include bactericidal agents, 
stabilizers, buffers, or the like. In order to enhance the half-life of the drug lead 
compounds in vivo, the compounds may be encapsulated, introduced into the lumen of 
liposomes, prepared as colloids, or another conventional technique may be employed 
that provides for an extended lifetime thereof. 
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The drug lead compounds may be administered as a combination therapy with 
other pharmacologically active agents or may physically linked to such agents or other 
carriers. Various methods for administration may be employed. The drug lead 
compounds may be given orally, or may be injected intravascularly, subcutaneously, 
5 peritoneally, etc. A "binding inhibitory amount" of the drug lead compounds will 
vary widely, depending upon the nature of the first and second biological molecules, 
the frequency of administration, the manner of administration, the clearance of the 
compound from the host, and the like. Appropriate binding inhibitory amounts may 
be determined empirically by those skilled in the art in a routine manner. 
1 0 candidate cross-linked target binding fragments 

II. Definitions and preferred embodiments 

In its broadest embodiment, the method of this invention comprises: 
assembling a library of candidate target binding fragments; screening the library of 

15 candidate target binding fragments for those that bind to a target molecule; cross- 
linking target binding fragments to produce a library of candidate cross-linked target 
binding fragments; screening the library of candidate cross-linked target binding 
fragments for those that bind to the target molecule. The product of this method is 
referred to as a lead pharmaceutical or drug candidate. 

0 More specifically (see Figure 9), the method of this invention is used to 

identify lead pharmaceutical drug candidates and optionally involves the following 
simple steps (a-h). 

(a) Assembling a library of candidate target binding fragments each 
fragment having a linkable (or linkage) functional group (LFG) or the blocked 
5 form thereof (BLFG), the blocked form containing linking group (LG); 



Assembling a library as used herein means any method of selecting two or more 
molecules to form a library for use in the method of this invention. Preferably the 
library will be large, greater than 50 members, and contain a diverse array of target 
interactive groups capable of forming non-covalent bonds, e.g. hydrogen, Van der 
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Waals, electrostatic, hydrophobic and the like with the target molecule. Such 
interactive groups'may include functional groups found on naturally occurring amino 
acid sidechains, carbohydrates, lipids, nucleic acids and their metabolites and 
derivatives thereof, or groups found on known pharmaceuticals. Optionally, a library 
may be customized to contain interactive groups known or suspected to interact with 
binding sites on the biological target molecule or its biological ligand. 

Candidate target binding fragments (CTBF) are small water soluble organic 
molecules having a molecular weight of about 200da (including the LFG) capable of 
forming a non-covalent complex with a target biological molecule (TBM). This 
complex may be of low affinity having a Kd as low as about 5mM. The CTBF's are 
commercially available or may be synthesized by known procedures. 

Linkable (or linkage) functional groups (LFG) include any functional groups 
capable of reacting with a chemically cross-reactive functional group (CFG) on a 
cross-linker (XL) thereby forming a stable covalent bond with the cross-linker. This 
covalent bond is referred to simply as a linking group (LG). When the linked 
molecule contains more than one linking group an integer following LG is used to 
indicate the number of the LG in the molecule. The LFG includes blocked, protected 
or otherwise transformed groups that may or may not react directly with the CFG on 
the cross-linker. This blocked form of the linkable functional group (BLFG) is often 
the preferred form of the CTBF because it may be less likely to form covalent linkages 
with the target biological molecule in the contacting step below. 



In the case where the BLFG is to be used in the contacting step, the de-protected 
or re-transformed form of the CTBF that it is capable or reacting with the CFG on the 
cross-linker is the form used in the cross-linking step below. By way of illustration, a 
CTBF may contain an aldehyde as a LFG or this aldehyde may be protected or 
transformed by reacting it with an O-amino alcohol to form an oxime (BLFG) as 
shown below. 



CTBF 



BLFG 
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In this case the oxime would be considered the BLFG. It will be appreciated that 
more than one reaction or transformation to an LFG may be made. By way of further 
illustration an aldehyde may be reacted with an amine to form a Schiff base, which in 
turn may be reduced to a secondary amine. This may be still further reacted to form an 
5 amide, sulfonamide, urea, carbamate etc. All these transformations of the initial 
aldehyde are also considered BLFG's. 

In the case where the CTBF (including the LFG) is first transformed or 
protected, the initial CTBF is sometimes referred to as a "precursor" while the 
transformed or protected form that is contacted with the TBM is referred to as a 
10 "monomer". Thus in the reaction illustrated above, the aldehyde may be called a 
precursor while the oxime is referred to as a monomer. In this case the oxime covalent 
bond (=N-0-) is referred to as the linking group (LG). It is often preferred that the 
monomer contain the same linking group (LG) as is present in the cross-linked form, 
described below, because some of the binding energy with the target may come from 
15 LG. 

Examples of linkage or linkable functional groups include; aldehyde, ketone, 
primary amine, secondary amine, epoxide, carboxylic acid, sulfonic acid, alcohol 
(hydroxyl), isicyanate, isothiocyanate, halide and sulfonate. These fuctional groups 
may act as precursors of the blocked linkage functional groups. 

20 Examples of CTBF's having blocked linkage or linkable functional groups are 

molecules containing Linkage Groups selected from; oxime, hydrazone, N-acyl 
hydrazone, secondary amine, tertiary amine, acetal, ketal, 1,2 amino alcohols, amide, 
N,N-disubstituted amides, thioamide, ureido, thioureido, carbamate, thiocarbamate, 
thiothiocarbamate, sulfonamide, carbamate, guanidino, amidino, thioester, ester, ether, 

25 2-hydroxyether, 2-hydroxythioether, thioether, disulfide, alkane (alkylene), alkene 
(alkenylene) and alkyne (alkynylene). Prefered monomers will contain the above 
functional groups as LG's. 

Steps b-d below may be combined into a single screening step that may be referred 
to as a first screening step, a pre-selection step or a pre-screening step. 

(b) Contacting the candidate target binding fragments with a target 
biological molecule (TBM). 
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Contacting of the TBM with one or more members of the library may be 
conducted either individually or multiply. Preferably each candidate target binding 
fragment is contacted individually with the TBM. For example, this may conveniently 
be done in a 96 well format plate so that the formation of a complex with each member 
5 of the library can be conveniently evaluated without requiring any deconvolution step. 
The contacting step is often conducted at relatively high concentrations of the CTBF, 
so that Kd's as low as about 5 mM can be measured (see below). 

The Target Biological Molecule (TBM) may be any biological molecule 
preferably of mammalian and most preferably of human origin. Optionally preferred 
10 TBM's may be human pathogen proteins such as viral proteins from viruses that infect 
human cells. Preferred TBM's are proteins most preferably secreted proteins. 
Preferred secreted proteins include; enzymes, cytokines, hormones, growth factors and 
their receptors. The TBM's may be isolated from natural sources or made 
recombinately in a host cell. Normally the atoms of the TBM will contain the natural 

1 5 abundance isotopes, but in some circumstances may be enriched. When the target is a 
receptor or a cell surface bound molecule, the TBM may conveniently be the 
extracellular domain or a derivative thereof. 

In a preferred embodiment of the method, the Target Biological Molecule is 
not a single biological molecule; rather it is two or more molecules. For example, 

20 preferred TBM's may be protein-protein, protein-DNA/RNA, protein-substrate pairs. 
By way of illustration, a ligand-receptor pair may be the actual target when an ELISA 
or other biological assay requiring two or more biological molecules is used to 
measure a physical association (see below) such as binding or activity. The binding 
constant, IC50 or other measurement may result from the CTBF binding with either 

25 the Ligand, receptor or both. Similarly, when selected fragments are cross-linked (see 
below) and re-screened against the ligand-receptor pair, the recombined fragments 
may bind with the ligand, receptor or both. 

(c) Measuring a change in a first physical association (PA-1) of the 
target biological molecule. 
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Measuring a change as used herein means any method capable of quantifying a 
physical association, including binding, of the CTBF with the TBM. 
A physical association (PA-1) of the target molecule includes a biological property 
such as binding with another biological molecule, signal transduction or catalysis of a 
5 reaction. It may also include any measurable physical chemical property such as a 
spectroscopic or magnetic property. Preferably the physical association will be 
suitable for rapid high throughput screening. The most preferred physical association 
measurement will be a biological one such as measuring binding or catalysis. An 
example of binding measurement would be an ELISA assay where protein-protein 
10 antagonism is measured. 

(d) Selecting target binding fragments (TBF) based on (c). 

Selecting target binding fragments (TBF) is based on the physical association 
1 5 measurement step. Selected TBF's will include those that bind relatively weakly with 
the TBM. Thus, for example in a binding assay such as ELISA, fragments that bind 
with no greater than a 5 mM affinity may be selected for. Most commonly the first 
selected CTBF's will bind to the target with an affinity of from 2 mM to 100 uM. 
TBF's or monomers that bind with a higher affinity e.g. Kd < 50 ixM are preferred, 
however such relatively high binding affinities are not necessary for selection for the 
cross-linking step. 



(e) Reacting target binding fragments with a cross-linker, having 
chemically compatible cross-reactive group(s) with the LFG, under conditions 
suitable for forming a library of candidate cross-linked target binding fragments 
(CXL-TBF). 



20 



25 



The selected target binding fragments, which individually normally bind to the 
30 target with relatively low affinity, are then cross-linked, normally in al! combinations 
and permutations, to produce a library of candidate cross-linked target binding 
fragments (CXL-TBF). By way or illustration, If TBF,„ and TBF„ are selected from 
the first screen, these molecules are then reacted with a suitable cross-linker such as a 
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bifunctional linker (BFL) to form candidate cross-linked target binding fragments 
(CXL-TBF) for the second screening step according to the general equation: 



10 



15 



20 



25 



.10 



TBF m 


— LFG + 


TBF„ 


—LFG + 


BFL 



CXL-TBF 



The BFL above will normally have two cross-reactive groups that are 
compatible with both LFG's, that is they will form stable covalent bond(s) (LG) with 
the LFG's under selected reaction conditions. 

Each TBF may consist of two (or three) parts or moieties when LFG contains 
an atom capable of forming two (or three) bonds other than the linking group (LG) 
bond formed with the cross-linker. For example, when LFG is the carbonyl of a 
ketone, the carbonyl carbon may be bonded to two alkyl, aryl etc parts or moieties 
(part A and part B or part C and part D). This case may be represented generally by 
the diagram below where, for example, two ketones are cross-linked with an O.O'- 
diamino-alkanediol cross-linker to form a di-oxime: 



TBF m -part A 



TBF n -part C 



LG5- | XL | — LG 4 



TBF m -part B 



TBF n -part D 



Here each of the two alkyl, aryl etc parts or moieties from the ketone are bonded to a 
nitrogen atom in the linking group (LG, =N-0-). 

The CXL-TBF's are sometimes referred to as "dimers", however, they are truly 
only dimers when the cross-linking moiety (XL) is only a chemical bond. In the most 
general case these "dimers" will contain a linking moiety, such as an alkane, alkene, 
arylene, alkyl ether and the like bonded through one or more linking groups (LG) to 
the corresponding TMF's. 

Reacting means chemically reacting so that a stable covalent bond or linking 
group LG is formed as the reaction product between the linkable functional group on 
the target binding fragment(s) and a chemically cross-reactive functional group (CFG) 
on the cross-linker. This step may be referred to as a cross-linking step or sometimes a 
"combination" or "recombination" step. 
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15 



20 



25 



30 



Compatible functional group herein means capable of reacting with to form a 
stable LG. 

The cross-linker is a small organic molecule having at least one cross-reactive 
functional group capable of reacting with the linkable functional group of at least one 
of the target binding fragments. Commonly, the cross-linker will have 2-4 such cross- 
reactive functional groups and most commonly 2 such groups. In this case the cross- 
linker is referred to as a bifunctional linker (BFL), either homo-bifunctional or hetero- 
bifunctional depending on whether the compatible functional groups are the same or 
different from one another. 

In the case where the cross-linker has a single cross-reactive functional group, 
the candidate cross-linked target binding fragment, CXL-TBF, is simply the reaction 
product of these two molecules: 

' LG-XL (or simply CXL-TBF) 



TBF m — LFG 1 + XLp — CFG 1 



CTBF 



A simple example of such a reaction would be the product of an aldehyde and 
an amine. In this exemplary case Rl is a TBF and R2 Is XL where LFG-1 is an 
aldehyde and CFG-1 is a primary amine: 



H 



Ri 



H 



In this case LG is the resulting secondary amine, -(NH)-. 

In the case where the cross-linker is a bifunctional linker, BFL, the linker has 
two cross-reactive linking groups and may be represented generally as: 

CFG 1 — | XL | — CFG 2 

In this case, reacting two target binding fragments from the first screen above, each 
with its own linking functional group with the above BFL may be represented: 

f CFG, — | XL [ — CFG 2 



TBF m — LFG, 



TBF n 



LFG, 



TBF„ 



-LGt— 



XL 



] LG H! 



TBF n 



In the most general case of the above reaction, the two target binding fragments 
will have different linkable functional groups and will be linked by a hetero- 
bifunctional linker (het-BFL). In the most common case, both TBF will have the same 
CFG and will be linked by a homo-bifunctional linker (homo-BFL). In some cases, 
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target binding fragments having different linkable functional groups may be linked 
with a homo-BFL and target binding fragments having the same linkable functional 
groups may be linked with a het-BFL. 

Higher order linking groups such as tri- and tetra-functional linkers may also 
5 be employed in the method of this invention. By way of illustration a Afunctional 
linker such as that shown below may be used: 




In each of the above cases, the product of the cross-linking reaction produces a 
candidate cross-linked target binding fragment(s) (CXL-TBF). These candidate 
molecules, typically with a molecular weight of 400 - 600 daltons, are then screened a 
1 5 second time against the target biological molecule. 

Steps f-h below may be combined into a single screening step that may be 
referred to as a second screening step, a selection step or a final screening step. 

(0 Contacting the candidate cross-linked target binding fragments 
20 with the target biological molecule. 

Contacting of the TBM and one or more members of the library of candidate 
cross-linked target binding fragments (CXL-TBF) may be conducted either 
individually or multiply as before. Preferably each candidate cross-linked target 

25 binding fragment is contacted individually with the TBM. The contacting step is 
usually conducted at lower concentrations of the CXL-TBF compared to CTBF's in 
the first screening step, so that Kd's in the micromolar or nanomolar range can be 
measured. The contacting format may be the same or different from that in the first or 
pre-screening step. Thus for example the two contacting steps may both be part of a 

30 binding assay (e.g. ELISA) or, for example, the second contacting step may be a 
functional activity assay or cell based binding assay. 



(g) measuring a change in a second physical association (PA-2) of the 

- 32 - 



WO 99/49314 



PCT/US99/06734 



target biological molecule. 

The second physical association (PA-2) measurement for the CXL-TBF's may be 
the same as that used for the CTBF's or may be a different physical measurement. 
5 Preferably the second physical association measurement will be a biological 
measurement rather than a physical chemical measurement such as spectroscopic or 
the like. 

(h) selecting cross-linked target binding fragments (XL-TBF) based of 

(g)- 

10 

The cross-linked target binding fragments (XL-TBF) selected will typically 
have a Kd, IC-50 or the equivalent of 500 nM or better. These XL-TBF' s are useful as 
drug lead pharmaceutical candidate molecules. 

15 m Specific Chemistry 

Many chemistries may be employed to produce linkable functional groups or to 
block or derivitize them. Similarly a wide array or linking chemistries are possible. 
Described below are a number of chemistries suitable for rapid high throughput 
screening. This description is meant to be illustrative and not limiting. In the 
20 description below the term "CTBF" is equivalent to "CTBF-part A" plus " CTBF-part 
B" defined above. Similarly, Linkable functional group is used interchangeably with 
linkage functional group. 

The term "alkyl" means a cyclic, branched or unbranched saturated or 
unsaturated hydrocarbon radical having the number of carbons specified, or if no 
25 number is specified, up to 12 carbon atoms. 

The term "aryl" means a homocyclic aromatic hydrocarbon radical having 
from 6-14 carbon atoms. Examples include phenyl, napthyl, biphenyl, phenanthrenyl, 
napthacenyl and the like. 

The term "heteroaryl" means a heterocyclic aromatic radical having from 4-13 
10 carbon atoms and from 1-6 heteroatoms selected from N, O, S and P. 

The term "heterocycle" means a saturated or partially unsaturated cyclic radical 
having from 3-13 carbon atoms and from 1-6 heteroatoms selected from O, S, N and P. 
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The term "alkoxy" means an alkyl radical, as defined above, substituted with 
an oxo radicai~(-0-). 

The term "acyl" means alkanoyl or alkylcarbonyl having from 1-12 carbon 

atoms. 

5 The term "carboxy ester" means acyloxy or alkanoyloxy having from 1-12 

carbon atoms. 

The term "carboxamide" means alkylcarbonylamino having from 1-12 carbon 

atoms. 

GENERAL NOTE: When CTBFs are employed with BLFGs, they may be 
10 prepared as described beloiv from CTBFs containing the corresponding LFGs. 
Alternatively, it may be practical to purchase, otherwise acquire, or prepare by known 
methods the CTBFs with BLFGs using alternative chemistry. 



15 



Candidate Target Binding Fragments or Molecules (CTBFs) 



L Aldehyde and ketone as the linkage functional groups and the corresponding 
blocked linkage functional groups 

(a). The linkage functional group is the carbonyl group present in an aldehyde 
or a ketone. These CTBFs may be available commercially, or may be prepared by a 
20 variety of known methods to those practiced in the art. 
Aldehyde CTBFs are represented as follows. 



CTBF 



— CHO 



Ketone CTBFs are represented with two different parts (A and B) of the CTBF 
25 attached to ketone carbonyl as shown below. 



CTBF-part A 



CTBF-part B 



30 



(b). Alternatively, the library is assembled with the oxime group as the 
blocked linkage functional group (BLFG). These CTBF's are prepared by 
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condensation of aldehydes or ketones LFG's with an O-substituted hydroxylamine. 
R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, heterocycle, carboxy 
5 ester, carboxamide, amino, Af-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, Af-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 



10 



15 



CTBF — CHO 4- H 2 N-OR 8 



CTBF 



jOR 8 

IF 

-CH 



CTBF-part A 



or 



CTBF-part A 



CTBF-part B 



N-OR 8 



CTBF-part B 



20 



(c). Alternatively, the library is assembled with the hydrazone group as the 
BLFG. These CTBFs are prepared by condensation of aldehydes or ketones LFGs 
with an ^-substituted hydrazine(March, Advanced Organic Chemistry, John Wiley & 
Sons, New York, 4 th edition, 1992, pp. 904-906). R 8 may be H, or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
25 or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, Af-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

,NHR 8 



CTBF 



-CHO 



H 2 N— NHR 8 



CTBF 



-CH 



30 



or 



CTBF-part A 



+ H 2 N— NHR 8 



CTBF-part A 



N-NHR 8 



CTBF-part B 



CTBF-part B 
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(d). Alternatively, the library is assembled with the N-acylhydrazone group as 
the BLFG. These CTBF's are prepared by condensation of aldehydes or ketone LFG's 
and an N-acyl hydrazine. Many reaction conditions are known to those practiced in 

5 the art (e.g., [March et al. supra, pp. 905-906] and [Li et al. Chem. Biol, 1: 37 
(1994)]). X may be nothing, O, S, NH or NR 9 . R« or R 9 may be H, or a straight chain 
or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R« or R 9 may also be appended with up to 
five RlO groups (R™ is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 

5 acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R« or R 9 may also be 
an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 

LJ 

XR 8 

_ ^_ If 

XR 8 



20 



CTBF — CHO 



H 



— ,Of 

CTBF — CH 6 



or 



| CTBF-part A \ -^ P 



HjN 



k N^XR 8 



CTBF-part B 



j CTBF-part A 



4 



o 



CTBF-part B 



(e). Alternatively, the library is assembled with an amine group as the BLFG. 
These CTBF's are prepared by reductive amination of aldehydes or ketone LFGs. A 
large number of reducing agents could be employed that are known to those practiced 

25 in the art (March, et al. supra, pp. 898-900). R« or R 9 may be H, or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R« or R 9 may also be appended with up to 
five RlO groups (R™ i s alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 

3q acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R« or R 9 may also be 
an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 
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CTBF — CHO + 



8 B 9 



CTBF-part A 



HNR B R 



8 0 9 



reducing agent 



CTBF — CH 2 



HNR B R 



or 



reducing agent 



CTBF-part A 



NR 8 R 9 
H 



CTBF-part B 



CTBF-part B 



(0- Alternatively, the library is assembled with an acetal or ketal group as the 
BLFG. These CTBF's are prepared by condensing the aldehyde or ketone LFG's with 
a diol. Conditions for the preparation of acetals or ketals are known to those practiced 
in the art (March, et al. supra, pp. 889-891). R 8 may be a straight chain or branched 
alkyl group of length 2 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, 
P, S) within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is 
alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl 
group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). R 8 may be a straight 
chain or branched alkyl group of length 2 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an 
aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF —CHO + 



HO-R^OH 



CTBF 



-c-o 

H 



or 



CTBF-part A 



HO-Ra-OH 



CTBF-part A 



CTBF-part B 



CTBF-part B 



(g). Alternatively, the library B LFG's may be prepared by condensing the 
aldehyde or ketone LFGS with an amino alcohol or an amino thiol. Methods to prepare 
the product oxazolidines and thiazolidines are known to those practiced in the art (e.g., 
oxazolidines: Ede, et al. Tetrahedron Letters, 38: 71 19-7122 (1997), and thiazolidines: 
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Patek et al. Tetrahedron Letters, 36:2227-2230 (1995)). R 8 or may be a straight 
chain or branched alky 1 group of length 2 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an 
aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). R 9 
may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 9 may also be 
appended with up to five R 1 * groups (R^ is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, Af-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 9 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, W-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



CTBF — CHO + HO— R 8 — NHR 9 



9-1 



8 



CTBF — C-NR 9 



or 



CTBF-part A | — + HO-RHMHR 9 — >-| CTBF-part A 



O-R* 
■NR 9 



CTBF-part B | [ CTBF-part B 



(h). Alternatively, the library is assembled with the alkene group as the 
blocked linkage functional group (BLFG). These CTBF's are prepared by 
condensation of aldehydes or ketone LFG's with phosphorous ylides (Maryanoff et al. 
Chemical Reviews, 89, 863-927 (1989). R 8 and R 9 may be H or a straight chain or 
branched alkyl group of length 2 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended with up to 
five R !0 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also be 
an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 and R 9 may also be halogens and heteroatoms. 



- 38 - 



WO 99/49314 



PCT/US99/06734 



1ctbf[-cho + LnR ^ c ^ R 



7 



>- R9 

ctbfJ-ch 



CTBF-part A 



CTBF-part A | 



I 

R 9 



| CTBF-part A| (jf 

[CTBF-part A 



15 



(i). Alternatively, carbanions, usually stabilized carbanions, may also be added 
into the carbonyl. Either the alcohol product A is obtained, or the hydroxyl group is 
eliminated to provide an alkene B. Numerous methods are available for performing 
this transformation (March et al., supra, 937-950). ). R 8 and R 9 may be H or a straight 
chain or branched alkyl group of length 2 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended with up to 
five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also be 
an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
20 carboxy ester, carboxamide, amino N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 and R 9 may also be halogens and heteroatoms. 



25 



H. 



CTB F— CH O + Q — R 



r 



base 



T 



a //f 

CTBF] — CH 



I R 9 



HQ I I 

30 . . R 8 ^ , . \ .CH— R 9 _ 

|CTBF-part A| ♦ ^ C ^ R » base » jCTBF-part A| CT | CTBF-part A] <p 

ICTBF-parfAl ^ |CTBF-partA] 1 CTBF-part 7T| 



A B 
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2. Primary and secondary amines as the linkage functional groups and the 
correspondirigljldcked linkage functional groups . 

(a). The linkage functional group is the basic nitrogen of primary or secondary 
amines. These CTBF's may be available commercially, or may be prepared by a 
5 variety of known methods to those practiced in the art. CTBF's that have a primary 
amine LFG's are represented as follows. 



CTBF 



-NH 2 



CTBF's that have a secondary amine LFG's are represented with two different parts 
(A and B) of the CTBF attached to the amine group as shown below. 



CTBF-part A 



NH 



CTBF-part B 



(b). Alternatively, the library may be assembled as secondary or tertiary amine 
BLFGs. 

(i). CTBF's with secondary or tertiary amine B LFG's may be prepared 
by reductive amination of primary amine or secondary amine LFG's, respectively, 
with aldehydes and ketones. A large number of reducing agents could be employed 
that are known to those practiced in the art (March, et al. supra, pp. 898-900). R 8 or 
R 9 may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 or R 9 may also 
be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 or R 9 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). Numerous reductive amination methods may be used that are 
known to those practiced in the art. 

^H-NH 2 + R*COR° Reducing A 9 ent > | Pharmacophore 



or 
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| CTBF-part A Hf" + r9 cQr8 

| CTBF-part B | 



Reducing Agent 



• | CTBF-part A \ ~ N 



pHR 8 R 9 



CTBF-part B 



10 



15 



20 



(ii). Alternatively, CTBF's with secondary or tertiary amine BLFG's 
5 may be prepared by reaction of primary or secondary amine LFG's, respectively, with 
an aryl, heteroaryl or alkyl group substituted with a leaving group X, where X may be 
a halide or a sulfonate group (OSO2R where R is substituted or unsubstituted alkyl or 
aryl, e.g. CH3, CF3, phenyl-CH3 and phenyl-N02>. The halide could be attached to 
aromatic or heteroaromatic functionality on R 8 , or it could be attached to an aliphatic 
group on R 8 . When X is substituted upon aromatic and heteroaromatic functionality, 
an StfAr reaction or a palladium-mediated (or related transtion metal mediated) amine 
coupling reaction would be performed [e.g, March, et al. supra, pp. 656-657; Wagaw 
et al., /. Am. Chem. Soc, 119: 8451-8458 (1997) and references therein; and Ahman et 
al. Tetrahedron Letters, 38: 6363-6366 (1997)]. Where X is substituted upon alkyl 
functionality, an Sn2 or SnI reaction would be performed (March, et al., supra, pp. 
411-413). R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 



25 



CTBF — NH 2 



X-R 8 



? 8 

CTBF h-IMH 



or 



CTBF-part A 



JR 8 



— NH + X-R 8 



CTBF-part A 



CTBF-part B 



i 



CTBF-part B 



(iii). Alternatively, CTBF's with secondary or tertiary amine BLFG's 
may be prepared by reaction of primary or secondary amine LFG's, respectively, with 
a substituted epoxide (March, et al., supra, p. 416). Many epoxides are available 
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commercially. Alternatively they can be prepared by a number of known methods to 
those practiced-in the art, most preferably by epoxidation of an alkene. R 8 to R 11 may 
be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 to R 11 may 
5 also be appended with up to five R 12 groups (R 12 is alkyl, aryl, halide, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 to R 11 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



10 



25 



.10 



CTBF — NH 2 + 



CTBF 



— NH R 



R io 
11 



or 



15 1 CTBF-part A ) — t|JM + R g \u ■ 



CTBF-part A 



R 11 



CTBF-part B 



| CTBF-part B 



(c). Alternatively, the library may be assembled as amide or thioamide 
BLFG's. These CTBF's may be prepared by coupling primary and secondary amine 
LFG's with carboxy lie acids (X = H), carboxylic acid derivatives (X = OR, SR, 
20 halide), or the corressponding thiocarboxylic acid derivatives (X = OR, SR, halide). 
Numerous methods are also available for coupling carboxylic acids and carboxylic 
acid derivatives with amines and are known to those practiced in the art [e.g., March, 
et al., supra, pp. 417-425; and Fields et al. Int. J. Peptide Protein Res. 35:181-187 
(1990)]. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is halide, alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acyl amino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is optionally 
substituted (halide, alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, 
N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 
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CTBF — NH 2 + 



CTBF-part A 



>C^R 8 

z = o,s 

CTBF-part B I Z = O.S 



CTBF I NH 

Z = O.S 



or 



| CTBF-part A f-y 



CTBF-part B 



(d). Alternatively, the library may be assembled as urea or thiourea BLFG's. 

(i). These CTBFs may be prepared by condensation of primary or 
1 0 secondary amine LFGs and isocyanates or isothiocyanates. The direct coupling of 
isocyanates and isothiocyanates with amines is known to those practiced in the art 
(March, et al., supra, p. 903). R 8 may be H, or a straight chain or branched alkyl 
group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, TV-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



15 



20 



CTBF — NH 2 + 



Z=C=N— R 8 
Z =0, S 



CTBF 



or 



25 



CTBF-part A — NH 



CTBF-part B 



Z=C=N— R 8 
Z =O.S 



CTBF-part A 



H H 

z = o,s 



CTBF-part B 



(ii). Alternatively, these CTBF's may be prepared by a two step 
process. In the first step, the primary or secondary amine LFG's is converted to 
10 carbamate (thiocarbamates) or related derivatives where X and Y are alkoxy groups, 
mercaptyl groups, halides, or other suitable leaving groups. In the second step a 
primary or secondary amine is added (e.g., Hutchins, Tetrahedron Letters, 35: 4055- 
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4058, (1994) and references therein). R 8 and R 9 may be H, or a straight chain or 
branched alkyPgroups of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended with up 
to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
Af-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also 
be aryl or heteroaryl groups that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino AT-acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 



(D 



(2) 



CTBF 



-NH> + a Y 

z = o,s 



CTBF h-fsj^ y 
H 



A. 



CTBF 



1. 



R 8 

+ NH 

H | 

z = o,s R 9 



CTBF 



Z = O.S 



or 



0) 



(2) 



CTBF-part A 



-NH 



CTBF-part B 



CTBF-part A 



CTBF-part B 



z = o, s 



R 9 



CTBF-part A 



CTBF-part B 



CTBF-part A 



CTBF-part B 



(iii). These CTBF's may also be prepared by condensation of primary 
or secondary amine LFG's and carbamates, thiocarbamates, or related derivatives 
where X is an alkoxy group, a mercaptyl group, or a halide (e.g., Hutchins, 
Tetrahedron Letters, 35: 4055-4058, (1994) and references therein). R 8 and R 9 may 
be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may 
also be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 and R 9 may also be aryl or heteroaryl groups that are optionally 
subslituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino /V-acylamino, 
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alkoxy, hydroxy, mercapto, phosphono). 



CTBF 



-NH, 



xr n 

R 9 

Z = 0,S 



CTBF 



.X...R' 



2 = O.S 



H R» 



or 



CTBF-part A 



+ xAjj' r8 



CTBF-part B 



z = o,s 



CTBF-part A I — N 7 n 

1 X R 9 



| CTBF-part B 



(d). Alternatively, the library may be assembled as sulfonamide BLFG's. 
These CTBF's may be prepared by coupling primary and secondary amine LFG's with 
sulfonic acids (X = H) or sulfonic acid derivatives (X = OR, SR, halide). Numerous 
methods are available for coupling sulfonic acids and sulfonic acid derivatives with 
amines and are known to those practiced in the art (e.g., March, et al., supra, p. 499, 
Greene, et al. Protective Groups in Organic Synthesis, John Wiley & Sons, New York, 
2 nd edition, 1991, pp. 379-385). R 8 may be H, or a straight chain or branched alkyl 
group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is halide, 
alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, Af-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group 
that is optionally substituted (halide, alkyl, aryl, halide, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF — NH 2 + 



CTBF-part A — NH 



or 



1 V 

CTBF h N - s -R° 

H 



CTBF-part A rB 



CTBF-part B 



CTBF-part B 



(e). Alternatively, the library may be assembled as carbamate, thiocarbamate 
or related BLFG's. These CTBF's may be prepared by condensation of primary and 
secondary amine pharmacophores with carbonyl derivatives (X = halide, OR, SR; Y = 
S,Q; Z = S,0). Numerous methods are known to those practiced in the art (e.g., March, 
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et al. f supra, p. 418; and Greene, et al., supra, pp. 315-348). R 8 may be H, or a 
straight chain OF*branched aikyl group of length 1 to 10, which may incorporate from 1 
to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to 
five R 9 groups (R 9 is aikyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (aikyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulfono). 



CTBF 



*YR 8 

to 

Z = O.S; Y = S,0 



CTBF 



X yr8 



or 



CTBF— Part A 



-NH 



+ X^YR 8 



CTBP— Part A 



CTBF — Part B 



Z = O.S; Y = S..O 



Ayr 8 



CTBF— Part B 



(f). Alternatively, the library may be assembled as guanidine BLFGs. These 
CTBFs may be prepared by condensation of primary and secondary amine 
pharmacophores with carbonyl derivatives (X = halide, OS02R, OR, SR) [Roskarnp, 
et al. Tetrahedron, 53:6697-6705 (1997)]. R 8 may be H, or a straight chain or 
branched aikyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is aikyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (aikyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulfono). 

CTBM J NH 2 + X-^NR«R9 | CTBM K m^MRBrS 



H 

or 
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[ CTBM — Fragment A — N 



-NH 



*JR 10 
^NR a R 9 



X 



R 10 



CTBM— Fragment B "] 



CTBM — Fragment A | — tsi^"NR g R 9 



CTBM— Fragment B 



10 



15 



20 



3. Epoxides as the linkage functional groups and the corresponding blocked linkage 
functional groups. 

(a). The linkage functional group is the epoxide group. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
practiced in the art. 

Epoxide CTBFs are represented with four parts (A through D) of the CTBF attached to 
the epoxide group as shown below. Each fragment may be H, or functionality 
whereby a carbon atom is directly attached to the epoxide functionality. 



CTBF— Part A 



CTBF — Part B 



CTBF— Part C 



CTBF — Part D 



25 



(b). Alternatively, the library may be assembled as 1,2-amino alcohol BLFGs. 
These CTBFs may be prepared by coupling epoxide LFGs with primary or secondary 
amines employing known methods to those practiced in the art (March, et al., supra, p. 
416).. R 8 or R 9 may be H, or a straight chain or branched alkyl group of length 1 to 
10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
or R 9 may also be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 or R 9 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF— Part 



CTBF— Part C 



CTBF — Part 



CTBF— Part D 



CTBF— Part 



CTBF— Part C 



CTBF— Part D 



(c). Alternatively, the library may be assembled as, 2-hydroxy thioether 
30 BLFGs. These CTBFs may be prepared by coupling epoxide LFGs with thiols 
employing known methods to those practiced in the art (Wardell, in Patai The 
Chemistry of the Thiol Group, pt. 1; Wiley, New York, 1974, pp. 246-251). R 8 may 
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be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from I to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). The 2-hydroxy thioether BLFGs may also be readily oxidized 
to more water soluble 2-hydroxy sulfoxide or sulfone BLFGs (March, et al., supra, pp. 
1201-1203). 



CTBF — Part A k^/\^H CTBF— Part C | [ CTBF — Part A f y? ^ \ CTBF— Part C | 

[gZr^H CTBF-P, rtD | + HSR * ~* I CTBF Part B 



CTBF— Part D 



(d). Alternatively, the library may be assembled as,2-hydroxy ether BLFGs. 
These CTBFs may be prepared by coupling epoxide LFGs with alcohols employing 
known methods to those practiced in the art(March, et al., supra, p. 391). R 8 may be 
H, or a straight chain or branched alkyl group of length 1 to 10, which may incorporate 
from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended 
with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also 
be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 



CTBF— Part 



1 CTBF — Part A K ^/V^ l CTBF— -Part C | [ CTBF— Part A kj^ CTBF — Part C 

IZTN ] CTBP-Pa.,0 | + H ° R " ~~ | CTBF-P. rtB K ^ffvf 



CTBF— Part D 



4. Carboxylic acids as the linkage functional groups and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the carboxylic acid. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
practiced in the art. 

Carboxylic acid CTBFs are represented as follows. 
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CTBF — COOH 



(b). Alternatively, the library may be assembled as amide BLFGs. These 
CTBFs may be prepared by coupling carboxylic acid LFGs (X = OH), or derivatives 
of carboxylic acid LFGs (X = OR, SR, halide) with primary or secondary amines. 
5 Numerous known methods are available for coupling carboxylic acids and carboxylic 
acid derivatives with amines to those practiced in the art [e.g., March, et al., supra, pp. 
417-425; and Fields et al. Int. J. Peptide Protein Res. 35:181-187 (1990)]. R 8 and R 9 
may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may 

10 

also be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 and R 9 may also be an aryl or heteroaryl group that is optionally 
substituted (alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, N- 
j 5 acylamino, alkoxy, hydroxy, mercapto, phosphono). 

{ + HN — R 8 I CTBF f-^ o r 



25 



30 



CTBF 

R 9 



(c). Alternatively, amine BLFGs could be prepared by reduction of amide 
BLFGs prepared as described above in step 4b from the corresponding carboxylic acid 
20 LFGs. A number of reducing agents could be employed that are known to those 
practiced in the art (e.g., March, et al., supra, pp. 1212-1213). R 8 or R 9 may be H, or 
a straight chain or branched alkyl group of length 1 to 10, which may incorporate from 
1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 or R 9 may also be appended 
with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 or R 9 
may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



i U Reducing Agent , , ^ r8; 

— V._«_o ► CTBF — CH 2 



CTBF x „ q 
NR*R 9 



(d). Alternatively, the library may be assembled as ester BLFGs. 

(i). These CTBFs may be prepared by coupling carboxylic acid LFGs 
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(X = OH), or derivatives of carboxylic acid LFGs (X = OR, SR, halide) with alcohols. 
Carboxylic acids '^hd carboxylic acid derivatives may be coupled with alcohols 
employing numerous known methods to those practiced in the art [e.g., (March, et al., 
supra, pp. 392-398) and (e.g., Greene, et al., supra, pp. 227-228), and (Hughes et 
5 aI.(Paquette, Series Editor in Chief), Organic Reactions, John Wiley & Sons, New 
York, 1992, vol. 42, pp. 343-347)]. R 8 may be H, or a straight chain or branched alkyl 
group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono)." R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, 
amino, //-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



with an aryl, heteroaryl or alkyl group substituted with a leaving group X, where X 
may be a halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted 
alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyI-N0 2 ). The halide could be 
attached to aromatic or heteroaromatic functionality on R 8 , or it could be attached to 
an aliphatic group on R 8 . When X is substituted upon aromatic and heteroaromatic 
functionality, an S^Ar reaction or a palladium-mediated, copper-mediated or related 
transtion metal mediated coupling reaction would be performed. Where X is 
substituted upon alkyl functionality, an Sn2 or SnI reaction would be performed. 
25 Methods for these transformations are known to those practiced in the art [e.g., 
(March, et al., supra, pp. 398-399) and (e.g., Greene, et al., supra, pp. 228-229)]. R 8 
may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, 



15 




(ii). These CTBFs may be prepared by reacting carboxylic acid LFGs 
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mercapto, phosphono). 

"— ^ + X-R 8 | CTBF {— ^ 



CTBF 



OH 



'OR 8 



(e). Alternatively the library may be assembled as thioester BLFGs. 

(i). These CTBFs may be prepared by condensation of carboxylic acid 
LFGs (X = OH), or carboxylic acid derivatives (X = OR, SR, halide) and thiols. 
Carboxylic acids and carboxylic acid derivatives may be coupled with thiols 
employing known methods to those practiced in the art (e.g., March, et al., supra, p. 
409). R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, //-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, 
alkoxy, hydroxy, mercapto, phosphono). 

"— £ + HS-R 8 >~ I CTBF f— C a 

-I N - 1 1 SR 8 



CTBF 



5. Sulfonic acids as the linkage functional groups and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the sulfonic acid. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
25 practiced in the art. 

Sulfonic acid CTBFs are represented as follows. 



CTBF — SO a H 



(b). Alternatively, the library may be assembled as sulfonamide BLFGs. 
These CTBFs may be prepared by reacting sulfonic acid LFGs (X = OH) or 
derivatives of sulfonic acids (X = halide, alkoxyl, mercaptyl) with amines (e.g., March, 
et al., supra, p. 499, Greene, et al. supra, pp. 379-385). R 8 and R 9 may be H, or a 
straight chain or branched alkyl group of length 1 to 10, which may incorporate from 1 
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to 10 heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended 



amino, 7V-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 
may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
5 heteroaryl, carboxy ester, carboxamide, amino N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



6. The hydroxyl group as the linkage functional group and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the hydroxyl group. 

These CTBFs may be available commercially, or may be prepared by a variety 
of known methods to those practiced in the art. 
Alcohol CTBFs are represented as follows. 



(b). Alternatively the library may be assembled as ether BLFGs. 

(i). The ether may be prepared by reaction of an alcohol LFG with an 
aryl, heteroaryl or alkyl group substituted with a leaving group X, where X may be a 
halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted alkyl or 
aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be attached to 
aromatic or heteroaromatic functionality on R 8 , or it could be attached to an aliphatic 
group on R 8 . When X is substituted upon aromatic and heteroaromatic functionality, 
an SnAr reaction or a palladium-mediated, copper-mediated or related transtion metal 
mediated coupling reaction would be performed [(e.g., March, et al., supra, pp. 654- 
655) and Hartwig et al., supra, pp. 8005-8008). Where X is substituted upon alkyl 
functionality, an Sn2 or SnI reaction would be performed. R 8 may be H, or a straight 
chain or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 



with up to five ISF 0 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
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or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



10 



15 



25 



CTBF —OH + X— R 8 CTBF — O-R 8 



(ii). The ether may be prepared by the Mitsunobu reaction between 
5 alcohol LFGs and another alcohol where the second alcohol is acidic (pKa < 12), for 
example, phenols and oximes (Hughes et al.(Paquette, Series Editor in Chief), Organic 
Reactions, John Wiley & Sons, New York, 1992, vol. 42, pp. 335-636). R 8 may be H, 
or a straight chain or branched alkyl group of length 1 to 10, which may incorporate 
from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with 
up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
Af-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an 
aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 
The key requirement on R 8 is that the alcohol has a pKa < 12. 

CTBF I— OH + HO— R 8 1 



CTBF 



-O-R 8 



(iii). The ether may also be prepared by the Mitsunobu reaction 
between alcohol LFGs and another alcohol, where the alcohol LFGs are acidic (pK a < 
12), for example, phenols or oximes (Hughes et al.(Paquette, Series Editor in Chief), 
20 Organic Reactions, John Wiley & Sons, New York, 1992, vol. 42, pp. 335-636). R 8 
may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, 7V-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
The key requirement on the pharmacophore substituted alcohol is that it has a pK a < 
12. 



CTBF —OH + HO— R 8 CTBF — O-R 



(c). Alternatively the library may be assembled as ester (Z = O) or thioester (Z 
• 10 = S) BLFGs. Alcohol LFGs may be coupled with carboxylic acids (Z = O, X = OH), 
carboxylic acid derivatives (Z = O, X = OR, SR, halide), or the corresponding thio- 
substituled derivatives (Z = S, X = OH, OR, SR halide). Carboxylic acids and 
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carboxylic acid derivatives may be coupled with alcohols employing a variety of 
known methods to those practiced in the art [e.g., (March, et al. f supra, pp. 392-398) 
and (e.g., Greene, et ah, supra, pp. 227-228), and (Hughes et al.(Paquette, Series 
Editor in Chief), Organic Reactions, John Wiley & Sons, New York, 1992, vol. 42, 

5 pp. 343-347)]. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 
10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is halide, alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, Af-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is optionally 

} substituted (halide, alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, 
N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



(d). Alternatively the library may be assembled from thioether BLFGs. The 
thioethers may be prepared by the Mitsunobu reaction between alcohol LFGs and a 
thiol (Hughes et al.(Paquette, Series Editor in Chief), Organic Reactions, John Wiley 
& Sons, New York, 1992, vol. 42, pp. 365-366, 381-382). R 8 may be H, or a straight 
chain or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, Af-acylamino, alkoxy, hydroxy, mercapto, phosphono). The 
thioether BLFGs may also be readily oxidized to more water soluble sulfoxide or 
sulfone BLFGs. 



(e). Alternatively the library may be assembled as carbamate (Z = O) or 
thiocarbamate (Z = S) BLFGs. 

(i). These CTBFs may be prepared by condensation of alcohol LFGs 
and isocyanates or isothiocyanates. The direct coupling of isocyanaies and 




+ 




z = o,s 



CTBF —OH + HS — R 8 



CTBF — S— R 8 



WO 99/4931 4 PCT/US99/06734 

isothiocyanates with alcohols is straighforward and obvious to those practiced in the 
art (March, et al., supra, pp. 891-892). R 8 may be H, or a straight chain or branched 
alkyl group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, 
P, S) within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is 
5 alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group 
that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

Z 

CTBF f— OH + Z=C=N — R 8 1 CTBF ho^N"" 8 



10 



15 



20 



H 

z=o, s z = o,s 

(ii). These CTBFs may be prepared by condensation of alcohol LFGs 
and carbamates, thiocarbamates, or related derivatives where X is an alkoxy group, a 
mercaptyl group, or a halide. R 8 may be H, or a straight chain or branched alkyl 
group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, Af-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



+ X^^ R8 | CTBF h yJ^R 



8 



CTBF I OH 

R 9 *9 

z = o,s z = o,s 

25 (iii). These CTBFs may be prepared in a two step process (e.g., March, 

et al., supra, p. 418; and Greene, et al., supra, pp. 315-348). In the first step, the 
alcohol LFGs are converted to carbonate (Z = O) or thiocarbonate (Z = S) or related 
derivatives where X and Y are alkoxy groups, mercaptyl groups, halides, or other 
suitable leaving groups. In the second step an amine is added to displace the leaving 
30 group Y. R 8 and R 9 may be H, or a straight chain or branched alkyl groups of length 
1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. 
R 8 and R 9 may also be appended with up to five R 10 groups (R 10 is alkyl, aryl, 
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heteroaryl, carboxy ester, carboxamide, amino, W-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 and R 9 may also be an aryl or heteroaryl group 
that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

» H + x^Y ^ | CTBF 



(1) 



CTBF 



(2) 



CTBF 



■ 

z = o,s 



Y 

z = o, S 



R 9 



CTBF 



z = o,s 

(f). Alternatively the library may be assembled with thioester BLFGs. The 
thioesters may be prepared by the Mitsunobu reaction between alcohol LFGs and a 
thiol acid (Hughes et al.(Paquette, Series Editor in Chief), Organic Reactions, John 
Wiley & Sons, New York, 1992, vol. 42, pp. 343-347). R 8 may be H, or a straight 
chain or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, Af-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF —OH + HS 



R 8 



CTBF 



(g). Alternatively the library may be assembled with amide, thioamide, urea, 
thiourea, sulfonamide, carbamate, thiocarbamate (Z - SO2, CO, CS, CO2, COS, CSO, 
CONR 11 , CSNR 11 .) BLFGs. Substitution of the hydroxyl LFGs are accomplished 
using the Mitsunobu reaction [ e.g., (Hughes et al.(Paquette, Series Editor in Chief), 
Organic Reactions, John Wiley & Sons, New York, 1992, vol. 42, pp. 335-636) and 
(Fukuyama, et ah, Tetrahedron Letters, 38: 5831-5834 (1997) and references therein)]. 
An aliphatic hydroxyl group is prefered. R 8 and R 9 may be H, or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended with up 
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to five R 10 groups (R 10 is alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). . R 8 and R 9 
may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
halide, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



CTBF —OH 



Ft 9 



CTBF 



.Z-R 8 



•1 



2 = S0 2 , CO, CS, C0 2 . COS, CSO, CONR 10 , CSNR 10 
(h). Alternatively, the library man be assembled with amine BLFGs. The 
amine BLFGs will be prepared from the BLFGs described in 6g. The acyl or sulfonyl 
functionality may be removed by methods known to those practiced in the art (such as, 
acidic or basic hydrolysis, or dissolving metal reactions for sulfonamides Greene et al., 
supra, 349-357 and 379-385). Milder conditions may be applied to more specialized 
groups, e.g., trifluoroacetamides may be cleaved by mild basic hydrolysis (Greene et 
al., supra, 353-354), and nitrosubstituted benzenesulfonamides may be cleaved by 
thiolate addition) to provide secondary amines (Fukuyama, et al., Tetrahedron Letters, 
38: 5831-5834 (1997) and references therein). Alternatively for some derivatives, 
reduction will provide tertiary amines (e.g., March, et al., supra, pp. 1212-1213). 



CTBF —I 



Z— R 8 



R 9 



CTBF 



Z-R 8 



or 

reduction 



CTBF 



H 
R 9 



CTBF 



Z = S0 2 , CO. CS, C0 2 , COS, CSO, CONR 10 , CSNR 10 
7. The thiol group as. the linkage functional group and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the thiol group. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
practiced in the art. 

Thiol CTBFs are represented as follows. 



CTBM — SH 
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(b). Alternatively the library may be assembled as thioether BLFGs. 

($! The thioether may be prepared by reaction of a thiol LFG with an 
aryl, heteroaryl or alkyl group substituted with a leaving group X, where X may be a 
halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted alkyl or 
5 aryl, e.g. CH,, CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be attached to 
aromatic or heteroaromatic functionality on R 8 , or it could be attached to an aliphatic 
group on R 8 . When X is substituted upon aromatic and heteroaromatic functionality, 
an StfAr reaction or a palladium-mediated, copper-mediated or related transtion metal 
mediated coupling reaction would be performed. Where X is substituted upon alkyl 
10 functionality, an Sn2 or S^l r&action would be performed. These methods are known 
to those practiced in the art [(March et a!., supra, pp. 407-409) and (Peach in Patai The 
Chemistry of the Thiol Group, pt. 1, John Wiley & Sons, New York, 1974, pp. 721- 
735)]. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
20 hydroxy, mercapto, phosphono). The thioether BLFGs may also b oxidized to more 
water soluble sulfoxide and sulfone BLFGs (March, et al., supra, pp. 1201-1203). 



25 



CTBF \ — SH + X— R 8 | CTBF ] — S-R 8 

(ii). The thioether may also be prepared by the Mitsunobu reaction 
between alcohol LFGs and a thiol (e.g., (Hughes et al. (Paquette, Series Editor in 
Chief), Organic Reactions, John Wiley & Sons, New York, 1992, vol. 42, pp. 335- 
636).. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
1() ester, carboxamide, amino, 7V-acylamino, alkoxy, mercapto, phosphono, sulphono). 
The thioether BLFGs may also b oxidized to more water soluble sulfoxide and 
sulfone BLFGs (March, et al., supra, pp. 1201-1203). 
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CTBF |— SH + HO— R 8 



CTBF — S— R 8 



(c). Alternatively the library may be assembled with BLFGs whereby the thiol 
LFGs are acylated (March et al„ supra, p. 409). Thiol LFGs may be coupled with 
carboxylic acids (Z = O, X = H), carboxylic acid derivatives (Z = O, X = OR, SR, 
5 halide), or the corresponding thio-substituted derivatives (Z = S, X = H, OR, SR 
halide). Carboxylic acids and carboxylic acid derivatives may be coupled with thiols 
employing known methods to those practiced in the art. R* may be H, or a straight 
chain or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R» may also be appended with up to five 
R 9 groups (R 9 is halide, alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R« may also be an aryl 
or heteroaryl group that is optionally substituted (halide, alkyl, aryl, halide, heteroaryl, 
carboxy ester, carboxamide, amino, W-acylamino, alkoxy, hydroxy, mercapto, 
j ^ phosphono). 



+ x^R 8 I CTBF L-S 



CTBF | — SH 

2=O.S 

(d). Alternatively the library may be assembled as thiocarbamate (Z = O) or 

20 thiocarbamate (Z = S) BLFGs. 

(i). These CTBFs may be prepared by condensation of thiol LFGs and 
isocyanates or isothiocyanates. The direct coupling of isocyanates and isothiocyanates 
with thiols is straighforward and obvious to those practiced in the art (Greene et al. 
supra, p. 301). R» may be H, or a straight chain or branched alkyl group of length 1 to 

25 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R» groups (R*> is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, W-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R8 may also be an aryl or heteroaryl group that is optionally substituted 

^ (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino W-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 
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R 8 



CTBF j rrgld + 2= C =N-R 8 | CTBF h fi^N' 

H 

z=o,s z = o,s 

(ii). These CTBFs may be prepared by condensation of thiol LFGs and 
carbamates, thiocarbamates, or related derivatives where X is an alkoxy group, a 
mercaptyl group, or a halide. R 8 may be H, or a straight chain or branched alkyl 
group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, 7V-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF 



I — SH 

I 



X^|SJ' R8 1 CTBF K Q X M ^1 



R 8 



z = o,s z = o,s 

(iii). These CTBFs may be prepared in a two step process. In the first 
step, the thiol LFGs are converted to carbonate (Z = O) or thiocarbonate (Z = S) or 
related derivatives where X and Y are alkoxy groups, mercaptyl groups, halides, or 
other suitable leaving groups (Greene et al. supra, pp. 299-301). In the second step an 
amine is added to displace the leaving group Y. R 8 and R 9 may be H, or a straight 
chain or branched alkyl groups of length 1 to . 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended with up 
to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
/V-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also 
be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 
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(1) 



CTBF 



— SH + X"^" 



Y 

Z = O.S 



CTBF 



(2) 



CTBF 



X 



Y + R<k NH 
R 9 



CTBF 



Z = 0,S " 2 = O.S 

(e). Alternatively the library may be assembled as disulfide BLFGs. The 
disulfides may be prepared by reacting the thiol LFGs with thiols (X = H) or activated 
thiols (X = mercaptyl, halide, sulfonyl) employing known methods to those practiced 
in the art (e.g., Greene et al., supra, pp. 302-303). R 8 may be H, or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is halide, alkyl, aryl, heteroaryl, carboxy ester, carboxamide. amino. .V- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an ary! 
or heteroaryl group that is optionally substituted (halide, alkyl, aryl. halide, heterean 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto. 
phosphono). 



CTBF — SH + XS-R 8 



CTBF — S — S-R 8 



5. The isocyanate or isothiocyanate group as the linkage functional group and uw 
corresponding blocked linkage functional groups 

(a). The linkage functional group is the isocyanate (Z = O) or isothicc> ^na:e 
(Z = S) group. These CTBFs may be available commercially, or may be prepared by a 
variety of known methods to those practiced in the art. 

Isocyanate (Z = O) or isothiocyanate (Z = S) CTBFs are represented as follow >. 



CTBF — N=C=Z 



z = o, S 

(b). Alternatively the library may be assembled with urea or thiourea BLFGs 
by reaction of the isocyanate (Z = O) or isothiocyanate (Z = S) LFGs with amines 
(March et al., supra, p. 903). The direct coupling of isocyanates with aniines :s 
straighforward and known to those practiced in the art. R 8 and R 9 may be H. or a 
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straight chain or branched alkyl groups of length 1 to 10, which may incorporate from 
1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended 
with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, Af-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 
may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino Af-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



CTBF 1— N=C=Z + HN-R 8 I CTBF f- N -^ N 

1 d9 1 ' H I 



ft- H • 9 

z = o, s „■ 

(c). Alternatively the library may be assembled as BLFGs by reaction of the 
isocyanate (Z = O) or isothiocyanate (Z = S) LFGs with thiols (Green et al., supra, p. 
301). R 8 may be H, or a straight chain or branched alkyl groups of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, Af-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 



CTBF 



-N=C=Z + HS-R 8 I CTBF f- N ^s^ R 

L 1 H 



JL. 



,8 



z = o, s 

(d). Alternatively the library may be assembled as BLFGs by reaction of the 
isocyanate (Z = O) or isothiocyanate (Z = S) LFGs with alcohols (March et al., supra, 
pp. 891-892). R 8 may be H, or a straight chain or branched alkyl groups of length 1 to 
10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 
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20 



25 



| CTBF [— N=C=Z + HO-R 8 



CTBF 



x. 



H 



Z = 0, S 

9. The halide or related sulfonate group as the linkage functional group and the 
5 corresponding blocked linkage functional groups 

(a). The linkage functional group is the halide leaving group or a similarly 
reactive sulfonate (X= OS0 2 R where R is substituted or unsubstituted alkyl or aryl, 
e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ) leaving group. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
I o practiced in the art. 

The halide and sulfonate (X= OS0 2 R where R is substituted or unsubstituted 
alkyl or aryl, e.g. CHL,, CF 3 , phenyl-CH^ and phenyl-N0 2 ) CTBFs may be represented 
as follows. 



CTBF — X 



15 

(b). Alternatively the library may be assembled with amine BLFGs. 

(i). The amine BLFGs may be prepared by substitution of the leaving 
group (X = halide or OS0 2 R where R is substituted or unsubstituted alkyl or aryl, e.g. 
CH^, CF 3 , phenyl-CH 3 and phenyl-N0 2 ) with amines The leaving group may be 
attached to aromatic or heteroaromatic functionality, alkenyl functionality, or it could 
be attached to aliphatic functionality. A number of methods are known to those 
practiced in the art [e.g, March, et al. supra, pp. 656-657 and 411-413; Wagaw et al., 
7. Am. Chem. Soc, 119: 8451-8458 (1997) and references therein; and Ahman et al. 
Tetrahedron Letters, 38: 6363-6366 (1997)]. When X is substituted upon aromatic 
and heteroaromatic functionality, an StfAr reaction may be performed. If the leaving 
group X is substituted upon aromatic, heteroaromatic or alkenyl functionality, a 
palladium-mediated (or related transition metal mediated) amine coupling reaction 
may be performed . Where X is substituted upon alkyl functionality, an S^2 or SnI 
reaction may be performed. R 8 or R 9 may be H, or a straight chain or branched alkyl 
3Q group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 or R 9 may also be appended with up to five R 10 groups (R 10 is 
alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 or R 9 may also be an aryl or heteroaryl 
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group that is optionally substituted (alky], aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF — X + NHR 8 R 9 



CTBF — NR"R 9 



(ii). The amine BLFGs may also be prepared by a two step process. In 
5 the first step, substitution of the leaving group X with an amide, thioamide, urea, 
thiourea sulfonamide, carbamate, thiocarbamate (Z = S0 2 , CO, CS, C0 2 , COS, CSO, 
CONR". CSNR") is performed (see 0 - 0 below). X may be a halide or OS0 2 R 
where R is substituted or unsubstituted alkyl or aryl, e.g. CH,, CF 3 , phenyl-CH 3 and 
^ phenyl-NO,. The leaving group may be attached to aromatic or heteroaromatic 
functionality, alkenyl functionality, or it may be attached to aliphatic functionality. . 
When X is substituted upon aromatic or heteroaromatic functionality, an S N Ar 
reaction may be performed. When X is substituted upon aromatic, heteroaromatic, 
allylic or alkenyl a palladium-mediated (or related transtion metal mediated) amine 
coupling reaction may be performed. Where X is substituted upon alkyl functionality, 
an S N 2 or S N 1 reaction may be performed. Methods are known to those practiced in 
the art (March et al., supra, pp.425-427). In the second step, the acyl or sulfonyl 
functionality may be removed by methods known to those practiced in the art (such as, 
acidic or basic hydrolysis, or dissolving metal reactions for sulfonamides (Greene et 
al., supra, 349-357 and 379-385). Milder conditions may be applied to more 
specialized groups, e.g., trifluoroacetamides may be cleaved by mild basic hydrolysis 
(Greene et al., supra, 353-354), and nitrosubstituted benzenesulfonamides may be 
cleaved by thiolate addition) to provide secondary amines (Fukuyama, et al., 
Tetrahedron Utters, 38: 5831-5834 (1997) and references therein). Alternatively for 
some derivatives, reduction will provide tertiary amines (e.g., March, et al, supra, pp. 
1212-1213). R8 to R" may be H, or a straight chain or branched alkyl group of length 
1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. 
R8 to R" may also be appended with up to five R" groups (R" is alkyl, aryl, halide, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R8 to R" may also be an aryl or heteroaryl group 
that is optionally substituted (alkyl. aryl, halide. heteroaryl, carboxy ester, 
carboxamide, amino. N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 
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(D CTBF | — X -f l)T^R 8 | CTBF h H 



(2) 



Z = S0 2 , CO. CS, C0 2 , COS, CSO, CONR 10 , CSNR 10 
| CTBF f -r^ ** | CTBF | — 



CTBF 



J-N^""' reduction > [ctbjQ-i^ 1 

R 9 R° 



R 9 

R 11 



(c). Alternatively the library may be assembled with ether BLFGs. The ether 
jq BLFGs may be prepared by displacement of leaving group LFGs with an alcohol. X 
may be a halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted 
alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be 
attached to aromatic or heteroaromatic functionality, alkenyl functionality, or it could 
be attached to an aliphatic functionality. When X is substituted upon aromatic or 

j ^ heteroaromatic functionality an SnAt reaction may be performed. When X is 
substituted on aromatic, heteroaromatic, allylic, or alkenyl functionality a palladium- 
mediated, Cu mediated, or related transtion metal mediated coupling reaction may be 
performed performed [(e.g., March, et aL, supra, pp. 654-655) and Hartwig et al. 
Tetrahedron Lett., 38: pp. 8005-8008). Where X is substituted upon alkyl 

20 functionality, an Sn2 or SnI reaction may be performed (e.g., March, et al., supra, pp. 
386-387 and 388-389). R 8 may be H, or a straight chain or branched alkyl group of 
length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the 
chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 

25 mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 

CTBF — X + HO— R 8 I CTBF \— O-R 8 



30 (d). Alternatively the library may be assembled with thioether BLFGs. The 

thioether BLFGs may be prepared by displacement of leaving group LFGs with an 
thiol. X may be a halide or a sulfonate group (OS0 2 R where R is substilutcd or 
unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH, and phenyl-N0 2 ). The halide 
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could be attached to aromatic or heteroaromatic functionality, alkenyl functionality, or 
it could be attached to an aliphatic functionality. When X is substituted upon aromatic 
or heteroaromatic functionality an SjsyAr reaction may be performed. When X is 
substituted on aromatic, heteroaromatic, allylic, or alkenyl functionality a palladium- 
mediated, Cu mediated, or related transtion metal mediated coupling reaction may be 
performed. Where X is substituted upon alkyl functionality, an Sn2 or S^l reaction 
may be performed. These methods are known to those practiced in the art [(March et 
al., supra, 407-409) and (Peach in Patai The Chemistry of the Thiol Group, pt. 1, John 
Wiley & Sons, New York, 1974, pp. 721-735)]. R 8 may be H, or a straight chain or 
branched alkyl group of lefigth 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). The 
thioether may also be oxidized to the more water soluble sulfoxide or sulfone BLFGs 
(March et al., supra, pp. 1201-1203). 

X + HS — R 8 1 



CTBF 



CTBF S-R 8 



(e). Alternatively the library may be assembled with ester BLFGs. The ester 
BLFGs may be prepared by displacement of leaving group LFGs with a carboxylate. 
X may be a halide or a sulfonate group (OS0 2 R where R is substituted or 
unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide 
could be attached to aromatic or heteroaromatic functionality, alkenyl functionality, or 
it could be attached to an aliphatic functionality. When X is substituted upon aromatic 
or heteroaromatic functionality an StfAr reaction may be performed. When X is 
substituted on aromatic, heteroaromatic, allylic, or alkenyl functionality a palladium- 
mediated, Cu mediated, or related transtion metal mediated coupling reaction may be 
performed. Where X is substituted upon alkyl functionality, an Sn2 or S^l reaction 
may be performed. . Methods for these transformations are known to those practiced in 
the art [e.g., (March, et al., supra, pp. 398-399) and (e.g., Greene, et al., supra, pp. 
228-229)]. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 



- 66 - 



WO 99/49314 



PCT/US99/06734 



10 



15 



30 



10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 

—X + M + CT^R 8 I CTBF f— O 



CTBF 



(Q. Alternatively the library may be assembled with thiolester BLFGs. The 
ester BLFGs may be prepared by displacement of leaving group LFGs with a 
thiolacid. X may be a halide or a sulfonate group (OS0 2 R where R is substituted or 
unsubstituted alkyl or aryl, e.g. CH3, CF 3 , phenyl-CH 3 and phenyl-NO,). The halide 
could be attached to aromatic or heteroaromatic functionality, alkenyl functionality, or 
it could be attached to an aliphatic functionality. When X is substituted upon aromatic 
or heteroaromatic functionality an S^Ar reaction may be performed. When X is 
substituted on aromatic, heteroaromatic, allylic, or alkenyl functionality a palladium- 
mediated, Cu mediated, or related transtion metal mediated coupling reaction may be 
performed. Where X is substituted upon alkyl functionality, an S^2 or SnI reaction 
2Q may be performed. R 8 may be H, or a straight chain or branched alkyl group of length 
1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. 
R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is optionally 
25 substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, 
alkoxy, hydroxy, mercapto, phosphono). 



,^o8 w I I c/ 



1 CTBF M + SH^R 8 [ CTBF h S 



(g). Alternatively the library may be assembled with urea or thiourea BLFGs. 
These BLFGs may be prepared by displacement of leaving group LFGs with a primary 
or secondary ureas (Z = O) or thioureas (Z = S). X may be a halide or a sulfonate 
group (OSQ 2 R where R is substituted or unsubstituted alkyl or aryl, e.g. CH„ CF 



3' 
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phenyl-CF 3 and phenyl-N0 2 ). The halide could be attached to aromatic or 
heteroaromatic- functionality, alkenyl functionality, or it could be attached to an 
aliphatic functionality. When X is substituted upon aromatic or heteroaromatic 
functionality an S^Ar reaction may be performed. When X is substituted on aromatic, 
heteroaromatic, allylic or alkenyl functionality a palladium-mediated, Cu mediated, or 
related transtion metal mediated coupling reaction may be performed. Where X is 
substituted upon alkyl functionality, an Sjsj2 or S^l reaction may be performed. R 8 to 
R 10 may be H, or a straight chain or branched alkyl group of length 1 to 10, which 
may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. . R 8 to R 10 
may also be appended with" up to five R 11 groups (R 11 is alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). . R 8 to R 10 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



H I , , Vl 



CTBF 



NR®R 9 



CTBF —X + X N^^NR 8 R 9 

R 10 — Rio 

z = o,s 

(h). Alternatively the library may be assembled with sulfonamide BLFGs. 
The sulfonamide BLFGs may be prepared by displacement of leaving group LFGs 
with a sulfonamide. X may be a halide or a sulfonate group (OS0 2 R where R is 
substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). 
The halide could be attached to aromatic or heteroaromatic functionality, alkenyl 
functionality, or it could be attached to an aliphatic functionality. When X is 
substituted upon aromatic or heteroaromatic functionality an SnAr reaction may be 
performed. When X is substituted on aromatic, heteroaromatic, allylic or alkenyl 
functionality a palladium-mediated, Cu mediated, or related transtion metal mediated 
coupling reaction may be performed. Where X is substituted upon alkyl functionality, 
an Sn2 or S^l reaction may be performed.. These methods are known to those 
practiced in the art [(March et al., supra, pp.425-427) and (Fukuyama et al., 
Tetrahedron Letters* 38:5831-5834 (1997)]. R 8 may be a straight chain or branched 
alkyl group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, 
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P, S) within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is 
alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group 
that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF 



R 9 



CTBF | — 



R 9 

(i). Alternatively the library may be assembled with carbamate, thiocarbamate, 
10 or related BLFGs. These BLFGs may be prepared by displacement of leaving group 
with the corresponding carbamate, thiocarbamate, or related BLFGs. X may be a 
halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted alkyl or 
aryl, e.g. CH3, CF 3 , phenyl-CH and phenyl-N0 2 ). The halide could be attached to 
aromatic or heteroaromatic functionality, alkenyl functionality, or it could be attached 
15 to an aliphatic functionality. When X is substituted upon aromatic or heteroaromatic 
functionality an S^Ar reaction may be performed. When X is substituted on aromatic, 
heteroaromatic, allylic or alkenyl functionality a palladium-mediated, Cu mediated, or 
related transtion metal mediated coupling reaction may be performed. Where X is 
substituted upon alkyl functionality, an Sn2 or S N 1 reaction may be performed. R 8 to 
20 R 9 may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. . R 8 to R 9 may 
also be appended with up to five R 11 groups (R 10 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). . R 8 to R 10 may also be an aryl or heteroaryl group that is optionally 
substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, 
alkoxy, hydroxy, mercapto, phosphono. 



CTBF 



"X + ,,% N^YR» ~ — 7 " YR8 
R 9 



CTBF 



30 Z = 0,S;Y = 0,S 

(j). Alternatively the library may be assembled by BLFGs where the leaving 
group is replaced with carbon-based functionality. 
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(i). X may be a halide or a sulfonate group (OS0 2 R where R is 
substituted or'ffrisQbstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). 
The metal (M) may be any metal, but the prefered metals are BL n , SnL n , ZnL n , ZrL n1 
CuL 0 , SiL n , Ti L n , Al L 0> and L n where L corresponds to metal ligands (e.g., halide, 
S alkoxide, alkyl, aryl, alkenyl, heteroaryl, phosphine sulfide, amido) many of which are 
acceptable and are known to those practiced in the art. The halide could be attached to 
aromatic or heteroaromatic functionality, alkenyl functionality, or it could be attached 
to an aliphatic functionality. Addition can occur directly, or can be catalyzed by 
transition metals. Numerous methods are known to those practiced in the art [e.g. 
10 (Hegedus, L. S. (1994) in Transition Metals in the Synthesis of Complex Organic 
Molecules pp. 65-129, University Science Books, Mill Valley), (Knochel et al. 
Chemical Reviews 93:2117-2188 (1993)), (Miyaura et aL. Chemical Reviews 95: 
2457-2483 (1995)), and (Farina et al.(Paquette, Series Editor in Chief), Organic 
Reactions, John Wiley & Sons, New York, 1997, vol. 50, pp. 1-653)]. R 8 may be H, 
15 or a straight chain or branched alkyl group of length 1 to 10, which may incorporate 
from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with 
up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an 
aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

X + M— R 8 



20 



CTBF 



CTBF 



R 



(ii). X may be a halide or a sulfonate group (OS0 2 R where R is 
substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). 

25 The BLFGs may be prepared in a two step process. In the first step, an 
organometallic reagent is prepared where the metal (M) may be any metal, but the 
prefered metals are BL n , SnL M , ZnL n , ZrL ti , CuL n , SiL n , Ti L n , Al L n , and L 0 where L 
corresponds to metal ligands (e.g., halide, alkoxide, alkyl, aryl, alkenyl, heteroaryl, 
phosphine sulfide, amido) many of which are acceptable and are known to those 

.10 practiced in the art. In the second step, carbon-carbon bond formation is performed to 
generate the BLFGs. Numerous methods for this two step process are known to those 
practiced in the art [e.g. (Hegedus, L. S. (1994) in Transition Metals in the Synthesis of 
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Complex Organic Molecules pp. 65-129, University Science Books, Mill Valley), 
(Knochel et al. Chemical Reviews 93:2117-2188 (1993)), (Miyaura et al.. Chemical 
Reviews 95: 2457-2483 (1995)), and (Farina et al.(Paquette, Series Editor in Chief), 
Organic Reactions, John Wiley & Sons, New York, 1997, vol. 50, pp. 1-653)]. R 8 
may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 

(1) I CTBF I— X ^ I CTBF I — M 



15 



(2) 



CTBF — M + 



R 8 X 



CTBF R 



20 



25 



(iii). X may be a halide or a sulfonate group (OS0 2 R where R is 
substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). 
The halide would preferably be attached to aromatic, heteroaromatic functionality, or 
alkenyl functionality. A palladium-mediated Heck reaction or related transtion metal 
mediated coupling reaction would be performed [e.g., (de Meijere, A. et al., Angew. 
Chem. Int. Ed, Engl., 33:2379-2411 (1994) and references therein) and Heck, 
Palladium Reagents in Organic Synthesis, Academic Press, London 1985)]. For the 
alkene, R 8 - R 9 may be straight chain or branched alkyl groups of length 1 to 10, which 
may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 - R 9 may 
also be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 - R 9 may also be aryl or heteroaryl groups that are optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 

P10 R 9 

CTBF —X + H^^-rS ^ I CTBF L ^L- D io 
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CTBF —X + 



CTBF —X + 



R" 



CTBF \ — X + H^-ss^H 



or 



CTBF 



^f^H or | CTBF f^^ H 
R 8 fc 



or 



CTBF 



H 

R 8 



or 



or 



CTBF 



^^R 8 or 



CTBF 



CTBF 



H 



R 8 




(iv). X may be a halide or a sulfonate group (OS0 2 R where R is 
substituted or unsubstituted alkyl or aryl, e.g. CH 3> CF 3 , phenyl-CHj and phenyl-N0 2 ). 
The halide would preferably be attached to aromatic, heteroaromatic functionality, or 
alkenyl functionality. A copper-mediated, palladium-mediated reaction, or related 
transtion metal mediated coupling reaction would be performed. Numerous methods 
are known to those practiced in the art [e.g., (March et al., supra, p. 481), 
(Sonagashira, K. in Comprehensive Organic Synthesis; Trost, B. M., Fleming, L, Eds.; 
Pergamon Press: New York, 1991, vol. 3 pp. 521-549), and (Rossi et al., Org. Prep. 
Proced. Int. 27:129-160 (1995)]. For the alkyne, R 8 may be H, straight chain or 
branched alkyl groups of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five R 9 
groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, 
alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be a aryl or heteroaryl 
group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

=== — R 8 



CTBF —X + — R 8 



CTBF 



10. The alkenyl group as the linkage functional group and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the alkenyl group. These CTBFs may be 
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available commercially, or may be prepared by a variety of known methods to those 
practiced in the art. 

CTBFs may be represented as follows, where as many as four CTBF fragments 
(A through D) may be present on the alkene functionality. 



CTBF Part A I y 

CTBF PartB \ \ 



CTBF Part C 

CTBF Part D 



(b). Alternatively the library may be assembled with modified alkene BLFGs. 

(i) . The alkene BLFGs may be prepared by performing a transition 
metal-mediated (typically Pd) catalyzed Heck reaction or related reactions on 
trisubstituted, disubstituted or the vinyl functionalized CTBFs [e.g., (de Meijere, A. et 
al., Angew. Chem. Int. Ed. EngL, 33:2379-2411 (1994) and references therein) and 

15 Heck, Palladium Reagents in Organic Synthesis, Academic Press, London 1985)]. R 8 
may be a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
20 R8 may also be a aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). X may be a halide or a sulfonate group (OS0 2 R where R is 
substituted or unsubstituted alkyl or aryl, e.g. CH^ CF 3 , phenyl-CH 3 and phenyl-N0 2 ). 

1 CTBF Part A CTBF— Part C | J CTBF part A 

25 I CTBF— PartB f H + | CTBF— Part B 

(ii) . The alkene BLFGs may be prepared by performing a transition 
metal (typically Ru or Mo) catalyzed olefin metathesis reaction [e.g., (Grubbs et al. 
Accounts of Chemical Research 28:446-452 (1995)) and (Schuster et al. Angewandte 
Chemie-International Edition in English 36:2037-2056 (1997))]. R 8 - R 14 may be 

10 

straight chain or branched alkyl groups of length 1 to 10, which may incorporate from 
1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 - R 14 may also be appended with 
up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
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amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 - R 14 may 
also be aryl of Tieterbaryl groups that are optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 



CTBF Part 



A~"Lr» tfo R" | CTBF PartA {__J *" | CTBF PartA L ft 1 ' 

CTBF PartBf^R* R"* 1 * I CTBF PartB f R" °' | CTBF Parts f R" 



11. Methods to Covalently Bond TBFs using the LFG with a Bifunctional Linker 
(BFL) to Produce Candidate Crosslinked Target Binding Molecules (CXL-TBFs). 



10 



TBF m 


—LFG + 


TBF n 


—LFG + 


BFL 



CXL-TBF 



Upon identification of the TBFs, crosslinking is accomplished with a BFL. All 
of the chemistry described for preparation of the BLFGs could be employed in the 
crosslinking step. One method would be to crosslink TBFs that have the same LFGs 
and employing the same crosslinking chemistry. A second method would be to 
15 crosslink TBFs that have the same LFGs, but employing different crosslinking 
chemistry. A third method would be to crosslink TBFs that have different LFGs 
which is most but not all cases would require different corsslinking chemistry. Each 
of these strategies use known methods to those practiced in the art. Examples of the 
three methods are provided below. 
20 Method 1. For example, two TBFs with aldehyde LFGs could be crosslinked 

employing a BFL that incoiporates two O-substituted hydroxylamines. 



25 



Method 2. For example, two TBFs with aldehyde LFGs could be crosslinked 
employing a BFL that incorporates one O-substituted hydroxylamine and one acyl 
hydrazide. 



[jFbf^-cko + QbfT}_cho ^V ^g ^"^ E™EZH=V4^ 

Method 3. For example, one TBF with an aldehyde LFG and one TBF with an 
amine LFG could be crosslinked employing a BFL that incorporates one O-substituted 
™ hydroxylamine and one carboxylic acid. 



A hypothetical example for the preparation of heterolinkers would be to 
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employ amino acids as heterolinkers. As shown below, the amino acid heterolinker 
serves to link TBFs with carboxylic acid LFGs to TBFs with amine LFGs. The amine 
LFG could be a primary amine or a secondary amine (not shown). Many methods that 
are known to those practiced in the art could be used to prepare the CXL-TBFs using 
5 amino acid heterolinkers. One of the methods is described below. 

[^>/ H + [jB^]-NH a + 

For this sample method, the amino acid BFL is protected as the N-tert- 
butoxycarbonyl (Boc) derivative. Many TV-Boc amino acids are commercially 
10 available, e.g., Novabiochem (San Diego, CA) and Neosystems (Strosbourg, France). 
//-Boc protected amino acids can also be prepared by known methods to those 
practiced in the art (Bodansky et al., The Practice of Peptide Synthesis, Springer- 
Verlag, Berlin, 1984, 18-20). 

The sample method is illustrated in Scheme 1. Sample experimental 
1 5 procedures are provided below. 

Step 1 (Desai, et aU Tetrahedron Letters. 34, 7685-7688. (1993)) 
To a suspension of polymer-bound carbodimide (1.5 mmol) in chloroform (10 
mL) is added the 7V-Boc protected amino acid 2 (0.55 mmol) and the amine-substituted 
TBF 1 (0.50 mmol). After the reaction mixture is shaken overnight at room 
20 temperature, the mixture is filtered. The resin is washed with chloroform (3 x 7.5 mL) 
and the combined filtrate is evaporated in vacuo to yield 3. 

Step 2 

To compound 3 (0.5 mmol) is added 5 mL of a solution of 4.0 M hydrochloric acid in 
dioxane (Aldrich, Milwaukee, WI). The solution is stirred for one hour at room 

IS 

temperature and then evaporated to remove the solvent and excess hydrochloric acid. 
The product is diluted with 5 mL of methanol and concentration is repeated to provide 
4. 

Step 3 (Desah et al.. Tetrahedron Letters. 34, 7685-7688, (1993)) 
To a suspension of polymer-bound carbodiimide (1.5 mmol) in chloroform (10 

•0 

mL)is added intermediate 4 (0.50 mmol) and the carboxylic acid-substituted TBF 5 
(0.55 mmol). After the reaction mixture is shaken overnight at room temperature the 
mixture is filtered. The resin is washed with chloroform (3 x 7.5 mL) and the 
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combined filtrate is evaporated in vacuo to yield product 6, which is the desired CXL- 
TBF. 

Scheme 1 

Step 1 : 

5 | TBF n h-NH, + BocHN | Unker f -^ O^-" ^ BocHN _J Unker lEs I 

1 2 3 



Step 2: 

) H m H 

, . M-\ TBF n I 4MHCMndioxane , ^- J TBF 1 I 

BocHN — | Unker {-^1 =J , CI H3N— | Unker ^ ' " I 



Step 3: 

0 O H O 

4 

5 6 

EXPERIMENTAL 

Unless otherwise noted, materials were obtained from commercial suppliers 
and used without further purifications. Aldehydes were purchased from Aldrich 
Chemical Company, Inc. (Milwaukee, WI). Anhydrous dimethylsulfoxide (DMSO) 
and acetic acid were purchased from Fischer (Pittsburg, PA). Soluble CD4 (sCD4) 
was purchased from Intracel Corporation (Issaquah, WA), gpl20 and anti-gpl20 
antibody were purchased from DuPont (Wilmington, DE) and o-phenylenediamine 
peroxidase substrate tablet sets were purchased from Sigma Chemical Co. (St. Louis, 
MO). Reactions were carried out in commercially available Beckman 2 ml deep-well 
microtiter plates. 

EXAMPLE I 

Pharmac ophore Recombination for the Identification of Compounds Capable of 
Inhibiting the Interaction Between gnl20 and CD4 
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To demonstrate the principle of pharmacophore recombination, we established 
a biochemical screen for the inhibition of gpl20-CD4 binding. This assay measures 
the ability of small molecules to inhibit the binding of gpl20 to sCD4 that is 
immobilized on a microtiter plate. Binding of sCD4 was quantified with an anti- 

5 gpl20 antibody conjugated to horseradish peroxidase. 

General procedure for the synthesis of an oxime compound library 

For several reasons, we chose to initially employ O-methyl oximes, rather than 
aldehydes, for the initial compound building block library. First, O-methyl oximes 
best model the pharmacophore units in the final oxime coupled dimers. Second, O- 

5 methyl oximes are more soluble in aqueous solution than their more hydrophobic 
aldehyde precursors. Also, the oxime functionality is clearly not inherently toxic and 
does not interfere with good pharmacokinetics or cell permeability since oximes are 
present in many drugs. Finally, the O-methyl oximes are easily prepared in a single 
step condensation of aldehydes with O-methyl hydroxylamine, without requiring 
purification of the resultant product. The chemical condensation of an aldehyde with 
O-methyl hydroxylamine to provide an oxime compound is shown in Figure 1. 

In the first step of the method, the initial oxime library was synthesized by 
separately condensing O-methyl hydroxylamine with 252 different aldehydes in a 
DMSO solution. The oxime library was prepared in a spatially separate fashion in a 
microtiter plate format such that each well contained a single oxime compound. More 
specifically, in each well of a microtiter plate, a DMSO solution of an unique aldehyde 
(0.188 ml, 0.15 M, 0.028 mmol) was added. To this solution, a DMSO solution of 0- 
methyl hydroxylamine (0.083 ml, 0.5 M, 0.042 mmol) was then added followed by 
addition of a DMSO solution of acetic acid (0.023 ml, 0.5 M, 0.01 1 mmol). The plates 
were allowed to sit at room temperature overnight during which time condensation 
occurred, thereby providing the 252 member library of oxime compounds. 
Assay to determine which oxime compounds are capable of inhibiting the 
interaction between gpl20 and CD4 

The 252 member oxime compound library prepared as described above was 
then screened for the presence of compounds capable of inhibiting the interaction 
between gpl20 and CD4 in a standard ELISA assay. For the gpl20-CD4 ELISA 
assay, an ImmuIon-2 microtiter plate was incubated overnight at 4 °C with 70 ng of 
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sCD4 in 100 \xl of carbonate buffer. The solution was removed from the plate and 
washed three t-knes with phosphate buffered saline (PBS) at pH 7.4. The plate was 
blocked with 150 |ll1 PB S -Tween-B S A (0.5% BSA, 0.05% Tween-20) for 1 h at room 
temperature and then washed again. gpl20 (1 ng) in 50 |xl of PBS and 50 (il of test 

5 organic oxime compound (3 mM), 40 \i\ PBS, 10 |xl were added and incubated for 1 h 
at room temperature. The plate was then washed and 100 \x\ of anti-gpl20 conjugated 
horseradish peroxidase was added and incubated for 1 h at room temperature. The 
bound gpl20 was then quantitated with o-phenylenedi amine as a substrate. 

The results of these assays demonstrated that 30 of the 252 oxime compounds 

3 were capable of inhibiting the interaction between gpl20 and CD4, wherein the 
approximate EC 50 values ranged from about 20 fxM to 500 ^M. The structurally 
related aldehyde analogs of 30 of these oximes showed diverse structural motifs 
including chromones, phenols and furans (see Figure 2). 

Cross-linking of the top 30 structurally related aldehyde analogs to produce a 
) library of candidate compounds and screening of those candidate compounds for 
the ability to inhibit the interaction between gpl20 and CD4 

Each of the 30 structurally related aldehydes analogs corresponding to the 30 
oxime compounds identified above as being capable of inhibiting the interaction 
between gpl20 and CD4 were individually coupled to each of the other 29 aldehydes 
with a variety of linkers to produce a library of candidate compounds for binding to 
the target molecule. Each of the 30 individual aldehydes was linked with another 
aldehyde through an 0,0'-diamino-alkanediol linker to obtain the library of cross- 
linked candidate compounds. Each aldehyde combination was kept spatially separate, 
but an equimolar mixture of five different O.O'-diamino-alkanediol linkers were used 
in each coupling reaction to provide a 450-member library of cross-linked candidate 
compounds. The chemistry used for preparation of the compounds is shown in Figure 
3 showing the synthesis of both homodimers and heterodimers. 

The five linkers employed each consisted of two hydroxylamine groups 
tethered to an aliphatic chain having either two, three, four, five or six methylene units. 
This allowed us to evaluate any distance dependency the two pharmacophores may 
have in the binding site. Linkers were prepared as follows. To a round-bottomed flask 
was added alkyl dibromide (20.2 mmol), N-hydroxyphthalimide (36.8 mmol, -1.8 
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equiv) and dimethylformamide (90 ml). The flask was cooled to 0 °C and 1,8- 
diazabicyclo[5.4.0]undec-7-ene (40.5 mmol) was added dropwise with stirring. The 
reaction was allowed to warm to room temperature and was then stirred overnight. 
The reaction mixture was then poured into 1M HC1 (500 ml). The resulting white 
5 solid precipitate was washed with water (3 x 50 ml) and methanol (3 x 50 ml) and sent 
onto the next step without further purification. 

13.9 mmol of the crude bis-Af-alkoxyphthalimide was then added to a round- 
bottomed flask in combination with dimethoxyethylene glycol (150 ml). To another 
flask was added hydrazine monohydrate (41.8 mmol, 3 equiv.) and dimethoxyethylene 
10 glycol (100 ml). The suspension of bis-N-alkoxyphthalimide was added slowly with 
stirring to the hydrazine solution. The flask was refluxed for 3 h, allowed to cool to 
room temperature and the resulting precipitate was filtered away. The remaining 
supernatant solution was concentrated and the resulting yellowish-oil was purified by 
Kugelrohr distillation (0.01 mm Hg, 60-70 °C) and column chromatography (89:9:2 
1 5 CHClj/MeOH/NH.OH). 

For 0,0'-di amino- 1,4-butanediol), the general synthesis procedure described 
above was followed. IR (film from CK^Cl,): 3412.8,3310.0,2942.7,2866.3 cm 1 . ! H 
NMR (400 MHz, CDC1 3 ): 8 5.31 (br s, 4H), 3.65 (m, 4H), 1.57 (m, 4H). ,3 C NMR 
(400 MHz, CDC1 3 ): 8 75.5, 24.8. Anal. Calcd for C 4 H l2 0 2 N 2 : c > 39 -"i H » 10 - 07 » N > 
10 23.32. Found: C, 40.17; H, 9.90; N, 23.12. 

Once the five linkers were obtained, the 450 linked aldehyde combinations 
were prepared as follows. In each well, a DMSO solution of each of two different 
aldehydes (0.045 ml, 0.15 M, 0.007 mmol each) was added. To this solution, a DMSO 
solution of an equimolar mixture of the five linkers (0.025 ml, 0.3 M of each linker, 
0.007 mmol) was added followed by a DMSO solution of acetic acid (0.005 ml, 0.5 M, 
0.003 mmol). The plates were allowed to sit at room temperature overnight to allow 
for potential ligand formation. 

Each of the 450 members of the potential ligand library was then tested at a 
concentration of 100 |xM each (i.e., a concentration that is 10-fold more dilute than the 

0 

concentration employed in the initial oxime monomer screen) for the ability to inhibit 
the interaction between gp!20 and CD4 using the ELISA assay described above. The 
results from these assays demonstrated that more than 300 of the 450 members of the 
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potential ligand library showed greater than 50% inhibition activity at the 
concentration employed. When the 450 members of the potential ligand library were 
tested at a concentration of 1 jiM each for the ability to inhibit the interaction between 
gpl20 and CD4, 17 of the ligands showed greater than 50% inhibitory activity at that 
5 concentration. The chemical structures of 12 of the 17 most active aldehyde 
precursors are shown in Figure 4. 
Evaluation of linker length dependence on activity 

The 17 linked aldehyde combinations with the greatest activity were then 
resynthesized as described above with a unique linker per well (85 separate wells) so 
0 as to evaluate any linker-length dependency on binding. Screening each of the 85 
ligand compounds at 1 jliM each showed that ligands that incorporated pharmacophore 
1 (see Figure 5) were more potent than the other ligands and that there indeed was a 
dependence upon linker length. Specifically, ligands with linkers having either 4 or 5 
methylene units were much more active than ligands with linkers having 6 methylene 
units, whereas ligands with linkers having either 2 or 3 methylene units were only 
slightly more active than ligands with linkers having either 4 or 5 methylene units. 
Biological and Analytical Characterization of Representative Ligands 

One of the ligands that exhibited strong activity for inhibiting the interaction 
between gpl20 and CD4 (as shown in Figure 6) was resynthesized with each of the 
five different linkers on a large scale and was then purified by column 
chromatography. Column chromatography purification enabled isolation of the 
heterodimer from the homodimer. 

Large scale synthesis of organic oxime compounds was performed as follows. 
To a flame-dried round-bottomed flask was added aldehyde (0.82 mmol) and DMSO 
(8 ml). A 0.9 M 0-methyl hydroxylamine (1.4 ml) was then added and the reaction 
mixture was allowed to stir at room temperature overnight: The reaction was poured 
into methylene chloride (50 ml), washed with H^O (3 x 20 ml), dried and concentrated. 
Silica gel chromatography provided the pure organic oxime compounds. 

The oxime compounds made by this method were characterized as follows. 
(1) O-methvl oxime of 6-nitropiperonal 

Reaction of 6-nitropiperonal with 0-methyl hydroxylamine provided 
predominantly one oxime isomer which was purified by silica gel chromatography 



- 80 - 



WO 99/49314 PCT/US99/06734 

(10:90, EtOAc/hexanes). 'H NMR (400 MHz, CDC1 3 ): 5 8.61 (s, 1H), 7.53 (s, 1H), 
7.37 (s, 1H), 6.15 (s, 2H), 4.00 (s, 3H). Anal. Calcd for C 9 H,0 3 N 2 : C, 48.22; H, 3.60; 
N, 12.50. Found: C, 48:40; H, 3.75; N, 12.56. 



5 (2) O-methvl oxime of 6.8-dichloro-3-formvIchromone 

Reaction of 6,8-dichloro-3-formyIchromone provided 1:1 cis/trans isomers 
which were isolated by silica gel chromatography (20:80, CHjCIj/hexanes). 

Isomer 1: 'H NMR (400 MHz, CDC1 3 ): 8 8.48 (s, 1H), 8.24 (s, 1H), 8.12 (d, 1H, J = 
10 2.5), 7.75 (d, 1H, J = 2.5), 3.97 (s, 3H). Anal. Calcd for C I1 H 7 0 3 NC1 2 : C, 48.56; H, 
2.59; N, 5.15. Found: C, 48.44; H, 2.47; N, 5.03. 

Isomer 2: 'H NMR (400 MHz, CDC1 3 ): 5 9.45 (s, 1H), 8.12 (d, 1H, J = 2.5), 7.75 (d, 
2H, J = 2.5), 4.07 (3H). Anal. Calcd for C.^O.NC^: C, 48.56; H, 2.59; N, 5.15. 
Found: C, 48.45; H, 2.49; N, 5.11. 

15 

Large scale synthesis of oxime ligands was performed as follows. To a flame- 
dried round-bottom flask was added 10 ml of DMSO and 1.03 mmol of each of the 
two aldehydes to be incorporated into the ligand. After all solids were dissolved, a 
solution of the appropriate linker (1.24 mmol) in 1 ml of DMSO was added dropwise, 
followed by the addition of acetic acid (0.72 mmol). The reaction mixture was 

20 

allowed to stir at room temperature overnight. The reaction was then poured into 
methylene chloride (50 ml), washed with K/D (3 x 20 ml), dried and concentrated. 
Silica gel chromatography provided the isolated homo/heterodimers. Cis/trans 
isomers, when present, were not separated and were purified as mixtures of isomers. 
The oxime dimers made by this method were characterized below. 

25 

(1) Oxime heterodimer of 6.8-dichloro-3-formvIchromone and 6-nitropiperonaK 
linker containing 4 methylene units 

The heterodimer was separated from the homodimers by silica gel 
chromatography (20:80, EtOAc/hexanes). The heterodimer was isolated and 
^ characterized as a 1:1 mixture of cis/trans isomers. Anal. Calcd for C 22 H l7 O g N 3 Cl 2 : C, 
50.59; H, 3.28; N, 8.05. Found: C, 50.70; H, 3.40; N, 7.89. 
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Isomer 1: 'H NMR (400 MHz, CDC1 3 ): 5 9.42 (s, IH), 8.55 (s, IH), 8.00 (s, 1H), 
7.67 (m, 2H); T.3<r (s, 1H), 7.27 (s, IH), 6.09 (s, 2H), 4.22 (m, 4H), 1.84 (m, 4H). 

Isomer 2: 'H NMR (400 MHz, CDCI 3 ): 8 8.54 (s, IH), 8.41 (s, 1H), 8.16 (s, 1H), 
7.99 (s, IH), 7.67 (s, IH), 7.40 (s, IH), 7.24 (s, IH), 6.08 (s, 2H), 4.22 (m, 4H), 1.83 
(m, 4H). 

(2) Oxime heterodimer of 6.8-dichloro- 3-formvlchrnmone and 6-nitropiperonaI. 
linker containine 5 methylene units 

The heterodimer was separated from the homodimers by silica gel 
chromatography (20:80, EtOAc/hexanes). The heterodimer was isolated and 
characterized as a 1.5:1 mixture of cis/trans isomers. Anal. Calcd for C^H^N^: 
C, 51.51; H, 3.57; N, 7.83. Found: C, 51.68; H, 3.70; N, 7.66. 

Isomer 1: 'H NMR (400 MHz, CDC1 3 ): 5 8.62 (s, IH), 8.47 (s, IH), 8.24 (s, IH), 8.10 
(s, IH), 7.74 (s, IH), 7.73 (s, IH), 7.37 (s, IH), 6.14 (s, 2H), 4.20 (m, 4H), 1.8 (m, 
4H), 1.53 (m, 2H). 

Isomer 2: 'H NMR (400 MHz, CDC1 3 ): 6 9.47 (s, IH), 8.61 (s, IH), 8.1 1 (s, IH), 7.75 
(s, 2H), 7.48 (s, IH), 7.33 (s, IH), 6.14 (s, 2H), 4.30 (t, 2H, J = 6.6), 4.20 (m, 2H), 
1.80 (m,4H), 1.53 (m, 2H). 

These purified heterodimers and homodimers were then tested as described 
above for the ability to inhibit the interaction between gpl20 and CD4. The results of 
these assays demonstrated that heterodimers shown in Figure 6 having a linker 
containing from 2 to 5 methylene units exhibited EC 50 's ranging from 0.6 to 1.5 nM 
and showing 10- to 20-fold enhancement in inhibitory activity over the compound 
shown in Figure 5 (EC 50 in the range of about 10-15 U.M). The other compound that 
was incorporated into the heterodimer had an EC 30 of greater than 50 fiM. 

The heterodimers shown in Figure 6 having linkers containing from 2 to 5 
methylene units are of comparable potency to the most potent compounds that have 
been identified to date that block the CD4/gpl20 interaction (Tanaka et al., J. 
Antibiotics 50:58 (1997), Sun et al., J. Antibiotics 49:689 (1997), Jarvest et al., Bio. 
Med. Chem. Lett. 3:2851 (1993) and Chen et al., Proc. Natl. Acad. Sci. USA 89:5872 
(1992)). In addition, these ligand heterodimers are considerably less complex than 
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previously identified compounds with comparable activity. Further optimization of 
the optimal building block and linker combinations could presumably be accomplished 
by evaluating a larger range of linkers with enhanced rigidity or by incorporating 
analogs of the optimal aldehyde precursors. 
5 EXAMPLE 2 

Pharmacophore Recombination Using N,N-Diiriethvlamines and Other Diamine 

Linkers 

In addition to the use of aldehydes and oximes for the pharmacophore 
recombination method as described above, additional chemistries also find use. In this 
10 example, the organic compound building blocks are N,N-dimethylamine compounds 
that are prepared by reductive amination of starting aldehydes and dimethylamine 
using support-bound triacetoxyborohydride (Kaldor et al., Tetrahedron Lett. 37:7193- 
7196 (1996)). The chemistry of these reactions is shown in Figure 7. Removal of the 
support-bound reducing agent by filtration followed by concentration to remove the 

1 5 volatile, excess dimethylamine then provides the pure N,N-dimethylamine monomer 
building blocks. Alternatively, the N,N-dimethylamine building blocks may be 
obtained by reduction using a sodium borohydride-based reducing agent in solution. 
The resulting amine product is then isolated from the excess reducing agent or 
aldehyde by passing down an acidic ion exchange column. The amine product is then 

20 obtained by elution from the ion exchange column with a volatile amine such as 
ammonia followed by concentration. 

Linkage of the N,N-dimethylamine building blocks can be accomplished 
through the use of diamine linkers of which many are commercially available and 
many more can be readily prepared using well known methodology. The commercial 

25 availability of the diamine linkers allows rapid optimization of linker length, rigidity 
and orientation. An exemplary synthesis sequence is shown in Figure 8. Specifically, 
support-bound chloride (3) (or other support-bound halide) is treated with excess of a 
diamine to provide an amine-derivatized support (4). Acylation of the amine 
functionality then provides support-bound formamide or carbamate (5). Reduction 

™ then provides support-bound secondary amine (6). Reductive amination then 
introduces one of the pharmacophore elements (7). Acid treatment then releases a 
secondary amine from the support, which can then be treated with the second 
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pharmacophore monomer and sodium triacetoxyborohydride to provide the desired 
pharmacophore heterodimer (either support-bound reagent or alternative scavenging 
methods may be employed). Initial attachment of the diamine to the support can be 
accomplished using other support-bound alkyl halides or could be accomplished by 
5 reductive amination of a support-bound aldehyde or ketone. Fewer linkers are 
available that contain two secondary amine groups, but these can also be incorporated, 
in this case, the acylation step (4 to 5 in Figure 8) and the subsequent reduction step (5 
to 6 in Figure 8) would be eliminated. 

The foregoing description details specific methods which can be employed to 

10 practice the present invention.- Having detailed such specific methods, those skilled in 
the art will well enough know how to devise alternative reliable methods at arriving at 
the same information in using the fruits of the present invention. Thus, however, 
detailed the foregoing may appear in text, it should not be construed as limiting the 
overall scope thereof; rather, the ambit of the present invention is to be determined 

1 5 only by the lawful construction of the appended claims. All documents cited herein 
are expressly incorporated by reference. 



20 



25 



30 



- 84 - 



WO 99/49314 



PCT/US99/06734 



WHAT IS CLAIMED IS : 

1. A method for identifying a drug lead compound that binds to a target 
biological molecule (TBM) of interest, the method comprising: 

(a) assembling a library of candidate target binding fragments (CTBF) 
5 capable of being chemically cross-linked by a cross-linker to provide candidate cross- 
linked target binding fragments for binding to the target biological molecule; 

(b) screening the library of candidate target binding fragments to identify at 
least first and second candidate target binding fragments that bind to the target 
biological molecule; 

10 ( C ) chemically cross-linking the at least first and second candidate target 

binding fragments or structurally related analogs thereof with a cross-linker to provide 
a library of candidate cross-linked target binding fragments for binding to the target 
biological molecule; and 

(d) screening the library obtained in (c) to identify a drug lead compound 

1 5 that binds to the target biological molecule. 

2. The method according to Claim 1, wherein at least one of the candidate 
target binding fragments of the library of candidate target binding fragments binds to 
the target biological molecule with a K d of from about 5 mM to about 0.05 mM. 

20 

3. The method according to Claim 1, wherein at least one of the candidate 
target binding fragments of the library of candidate target binding fragments binds to 
the target biological molecule with a K d of from about 3 mM to about 0.1 mM. 

25 4. The method according to Claim 1, wherein the drug lead compound 

identified in step (d) binds to the target biological molecule with a K d of 500 nM or 
lower. 

5. The method according to Claim 1, wherein the screening steps (b) and 
10 (c) consist essentially of an in vitro biological assay. 
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6. The method according to Claim 1, wherein the library of candidate 
cross-linked target binding fragments for binding to the target biological molecule 
comprises homodimeric or heterodimeric candidate cross-linked target binding 
fragments. 

5 

7. The method according to Claim 1, wherein the library of candidate 
target binding fragments comprises candidate target binding fragments of less than 
about 500 daltons. 

10 8. The method according to Claim 1 wherein the library of candidate 

cross-linked target binding fragments comprises candidate cross-linked target binding 
fragments of less than about 750 daltons. 

9. The method according to Claim 1, wherein the target biological 
1 5 molecule is a human or human pathogen protein. 

10. The method according to Claim 9, wherein the protein is an enzyme, a 
human hormone, a human receptor and fragments thereof having nitrogen's in the 
protein present in their naturally occurring isotopic abundance. 

20 

11. The method according to Claim 1, wherein at least one of the screening 
steps (b) and (d) is accomplished by ELISA assay. 

12. A method for inhibiting the binding of a first biological molecule to a 
25 second biological molecule that binds to the first biological molecule, the method 

comprising: 

contacting a system comprising both the first and second biological molecules 
with a binding inhibitory amount of a drug lead compound identified according to the 
method of Claim 1, wherein the drug lead compound binds to the first biological 
■ 10 molecule and inhibits its ability to bind to the second biological molecule. 
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13. The method according to Claim 12, wherein the first and second 
biological molecules are human proteins. 

14. The method according to Claim 13, wherein the first and second 
5 biological molecules are selected from the group; a human hormone, cytokine or 

chemokine, a human receptor and fragments thereof 

15. A method for identifying a drug lead compound that binds to a target 
biological molecule of interest, the method comprising: 

10 (a) assenbling a library of candidate target binding fragments, each 

fragment containing an oxime linking group; 

(b) screening the library of candidate target binding fragments or 
monomers to identify at least first and second oxime containing candidate target 
binding fragments that bind to the target biological molecule; 

1 5 ( c ) chemically crosslinking the aldehyde analogs of the at least first and 

second oxime containing candidate target binding fragments with an 0,0'-diarnino- 
alkanediol cross-linker to provide a library of oxime containing candidate cross-linked 
target binding fragments for binding to the target biological molecule; and 

(d) screening the library obtained in (c) to identify a drug lead compound 
20 that binds to the target biological molecule. 

16. A method comprising: 

(a) assembling a library of candidate target binding fragments (CTBF), 
each fragment having a linkable functional group (LFG) or blocked form thereof 

25 (BLFG), the blocked form containing a linking group (LG); 

(b) contacting the candidate target binding fragments with a target 
biological molecule (TBM); 

(c) measuring a change in a first physical association (PA-1) of the target 
biological molecule; 

™ (d) selecting target binding fragments (TBF) based on (c); 

(e) reacting selected target binding fragments having a linkable functional 
group with a cross-linker, having chemically compatible cross-reactive groups (CFG) 
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with the LFG, under conditions suitable for forming a library of candidate cross-linked 
target binding-fragments (CXL-TBF); 

(f) contacting the candidate cross-linked target binding fragments with the 
target biological molecule (TBM); 
5 (g) measuring a change in a second physical association (PA-2) of the 

target biological molecule; 

(h) selecting cross-linked target binding fragments (XL-TBF) based on (g). 

17. The method of Claim 16 wherein the candidate target binding fragment 
10 contacted with the TBM contains a blocked linkage functional group (BLFG) 

containing linking group LG. 

18. The method of Claim 17 where in the linking group (LG) in BLFG is 
selected from the group; oxime, hydrazone, N-acyl hydrazone, secondary amine, 

15 tertiary amine, acetal, ketal, 1,2 amino alcohols, amide, N,N-disubstituted amides, 
thioamide, ureido, thioureido, carbamate, thiocarbamate, thiothiocarbamate, 
sulfonamide, carbonate, guanidino, amidino, thioester, ester, ether, 2-hydroxyether, 2- 
hydroxythioether, thioether, disulfide, alkane (alkylene), alkene (alkenylene) and 
alkyne (alkynylene). 

20 

19. The method of Claim 18 wherein each candidate target binding 
fragment (CTBF) of step (b) contains the same linking group (LG) as is present in the 
candidate cross-linked target binding fragment (CXL-TBF) of step (f). 

25 20. The method of Claim 19 wherein the linking group (LG) is selected 

from the group; oxime, secondary amine, tertiary amine, amide, ureido, thioureido, 
sulfonamide and carbamate. 

21. The method of Claim 19 wherein two candidate target binding 
10 fragments (CTBF) selected from step (d) are cross-linked to form a candidate cross- 
linked target binding fragment (CXL-TBF) represented by 
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TBF m | — LGh— 1 XL | LG-2 — | TBF n 



10 



15 



where 

TBF m is a first TBF selected from step (d) which contained LG, in its blocked 
5 linking group ; 

TBF n is a second TBF selected from step (d) which contained LG 2 in its 
blocked linking group; 

XL represents the cross-linker without the chemically compatible cross- 
reactive functional groups; 

LG, represents the linking group in the first TBF; and 
LG 2 represents the linking group in the second TBF. 

22. The method of Claim 21 wherein LG, and LG 2 are the same or are 
different and are selected from the group oxime, secondary amine, tertiary amine, 
amide, ureido, thioureido, sulfonamide and carbamate. 

23. The method of Claim 21 wherein TBF m and TBF o are the same or are 
different. 



23. The method of Claim 21 where XL is selected from the group of 
alkanes: methylene, ethylene, propylene, butylene, pehtylene, hexylene and 
heptylene,optionally containing 0, 1, 2 or 3 ether linkages and from 1-3 double bonds 
and aryls: ortho-, meta- or para- C 0 -C 6 -alkyl-phenyl-C 0 -C 6 -alkylene . 

24. The method of Claim 16 wherein the candidate cross-linked target 
binding fragments are represented by the formula: 



TBF m | LGq— | XL I LG?— I TBF n 



30 where 



TBF m represents a first TBF selected from step (d); 
TBF (| represents a second TBF selected from step (d); 
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XL represents the cross-linker without the chemically compatible cross- 
reactive functional -groups selected from the group 
C 0 -C l0 -aIkylene, 

C 0 -C 6 -alkyl-C 6 -C to -aryl-C 0 -C 6 -alkylene, 
CpQ-alkyl-^R^-C.-C^alkylene, 

(C r C 6 -aIkyl-0-C r C 6 -aIkylene) n , where n=l, 2, 3 or 4; 
LG, and LG 2 are linking groups independently selected from the group 
-C(R a )=N-0-, -0-N=C(R>, -CH^NCR.)-, -NfRJ-CH,-, 
-C(=0)-N(R>, -N(R a )-C(=0)-, -N(R.)-C(=0)-0-, -0-C(=0)-N(R>, 
-N(R a )-C(=O^N(R b )-, -N(R>C(==0)-N(R b )-, -SO r N(R> and 
-N(R a )-S0 2 -; 

R a and R b are independently selected from the group 

hydrogen, C r C l0 -alkyl, C 0 -C 10 -alkyl-C 6 -C 10 -aryl, C 6 -C l0 -aryl-C 0 -C 10 - 
alkyl, C 0 -C I0 -alkyl-heterocycle-C 0 -C l0 -alkyl, C r C 6 -alkyl-NH-C r C 6 - 
alkyl, C 0 -C 10 -alkyl-O-C 0 -C 10 -alkyl, C 0 -C l0 alkyl-C(=O)-C 0 -C I0 -alkyl, C 0 - 
C f0 -alkyl->m-C(=O)^^^^ alkyl 
where any alkyl, aryl or heterocycle is optionally substituted with C r C I0 - 
alkyl, C r C I0 -alkoxy, C 6 -C 10 -aryl, C 6 -C I0 -aryloxy, halo (F, CI, Br, I), 

hydroxy, carboxy, amino, nitro and S(O) 03 . 

25. The method of Claim 24 wherein the TBF m and TBF,, from step (d) 
each independently bind to the target biological molecule with a K d of from about 3 
mM to about 100 fxM. 

26. The method of Claim 25 wherein the TBF m and TBF 0 from step (d) 
each independently bind to the target biological molecule with a K d of from about 2 
mM to about 500 jxM. 

27. The method according to Claim 24, wherein the target biological 
molecule is a human or human pathogen protein. 
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28. The method according to Claim 24, wherein the protein is an enzyme, a 
human hormone or a human receptor having nitrogen's in their naturally occuring 
isotopic abundance. 

5 29. The method according to Claim 24, wherein at least one of steps (b) and 

(d) is accomplished by ELISA assay. 

30. The method of Claim 21 wherein 



10 



TBF m | L&f— | XL | LG^- 
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is selected from the group; 
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r t 

Rl R 2 



and 



31. The method of Claim 16 wherein the candidate cross-linked target 
binding fragments are represented by the formulae: 



10 



TBF„ 



— LGt— | XL | LGs— [ 



TBF n 



and 



15 



20 



30 



TBF m -part A | | TBF n -part C~ 



LQj— | XL f — LG, 



TBF m -part B 



TBF n -part . D 



where 

TBF m represents a first TBF selected from step (d); 
TBF n represents a second TBF selected from step (d); 

TBF m -part A and B represent TBF m from step (d) where each fragment is 
bonded to a single atom in LG 3 ; 

TBF n -part C and D represent TBF n from step (d) where each fragment is bonded 
to a single atom in LG 4 ; 

XL represents a cross-linker of the formula 
25 -(C 0 -C 2 -alkyl.L , -L 2 -L 3 -L 4 -L 5 -C 0 -C 2 -alkyl)-; 

LG, and LG 2 are linking groups independently selected from the group 
-C(R>N-0-, -0-N=C(R,)-, -CH^R.)-, -NCRJ-CH,-, -C(=0)-N(R a )-, -N(R>C(=0)-,. 
-N(R>C(=0)-0-, -0-C(=0)-N(R a H -N(R a )-C(=0)-N(R b )-, -N(R>C(=0)-N(R b )-, - 
SO r N(R )- and -N(R>SO r ; 

LG y and LG 4 are linking groups independently selected from the group 
>C=N-0-, -0-N=C<, -CH,-N<, >N-CH,- t -C(=0)-N<, >N-C(=0)-, >N-C(=0)-0, -O- 
C(=0)-N<, >N-C(=0)-N(R b )-, -N(R>C(=0)-N<, -SO r N< and >N-SO r , where < and 
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> represent two bonds linking TBF-part A, B, C, or D to the single N or C atom in LG 3 
or LG 4 ; 

R a and R b are independently selected from the group 
hydrogen, C r C I0 -aIkyl, C 0 -C I0 -alkyl-C 6 -C l0 -aryl, C 6 -C l0 -aryl-C 0 -C I0 -aIkyl, C 0 -C l0 -aIkyl- 
heterocycle-C 0 -C 10 -alkyl, C r C 6 -alkyl-NH-C r C 6 -alkyl, C 0 -C I0 -alkyl-O-C 0 -C 10 -alkyl, C 0 - 
C l0 alkyl-C(=O)-C 0 -C I0 -alkyl, C 0 -C I0 -alkyl-NH-C(=O)-C 0 -C I0 -alkyl, C 0 -C 10 -alkyl-O- 
C(=O)-C 0 -C I0 -alkyl, where any alkyl, aryl or heterocycle is optionally substituted 
with C r C 10 -alkyl, C,-C l0 -alkoxy, C 6 -C I0 -aryl, C 6 -C 10 -aryloxy, halo (F, CI. Br, I), 
hydroxy, carboxy, amino, nitro and S(0)^ 3 ; 

TBF m , TBF n , TBF m -part A, TBF in -part B, TBF n -part C and TBF n -part D are 
each independently represented by formula I 



-A-(Cycle l)-B-(Cycle 2)-E (I) 

Where 

Cycle 1 and Cycle 2 are independently present or absent and are selected from 
a mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring, each ring having 5, 
6 or 7 atoms in the ring where the ring atoms are carbon or from 1-4 heteroatoms 
selected from; nitrogen, oxygen, and sulfur, and where any sulfur ring atom may 

optionally be oxidized and any carbon ring atom may form a double bond with O, NR n 
11'. 

and CR R , each ring nitrogen may be substituted with R n and any ring carbon may 
be substituted with R d ; 

A and B are independently selected from 

L 3 — L 2 L 1 



L 3 L 2 — L 1 - 



and 

L*— L 4 L 3 L 2 — L 1 

where: 



L 1 is absent or may be selected from oxo (O), S(O) , C(=0), C(=N-R n ), 

s 

C(=CR 1 R 1 '), CXRV), C(R l ), C, het, N(R n ) or N; 
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L 2 is absent or may be selected from oxo (O), S(O) , C(=0), C(=N-R n ), 

s 

C(=CR 2 R 2 '), C(R 2 R 2 '), C(R 2 ), C, het, N(R n ) or N; 

L 3 is absent or may be selected from oxo (O), S(O) , C(=0), C(=N-R n ), 

s 

5 C(=CR 3 R 3 '), C(R 3 R 3 ') C(R 3 ), C, het, N(R n ) or N; 

L 4 is absent or may be selected from oxo (O), S(O) , C(=0), C(=N-R n ), 

s 

C(=CR 4 R 4 '), C(R 4 R 4 '), C(R 4 ), C, NR n or N; and 

L 5 is absent or may be selected from oxo (O), S(O) , C(=0), C(=N-R n ), 
1 0 s 

C(R 5 R 5 \ C(=CR 5 R 5 '), C(R 5 ), C, NR n or N; 

R 1 , R 1 ', R 2 , R 2 \ R 3 , R 3 ', R 4 , R 4 \ R 5 and R 5 each are independently 

selected from R a , R a \ R° and U-Q-V-W; where s is 0-2 

15 Optionally, each R*-R 5 or NR n together with any other R*-R 5 or NR n may 

form a mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring, each ring being 
a homo- or heterocycle having 5, 6 or 7 atoms in the ring, optionally each ring 
containing 1-4 heteroatoms selected from N, O and S where any ring carbon or sulfur 



20 



25 



30 



atom may optionally be oxidized, each ring nitrogen optionally substituted with R n 
and each ring carbon optionally substituted with R d ; 
Eis -L'-L 2 -L 3 -R a ; 

R a is selected from the group; hydrogen, halo(F, CI, Br, I), halo(F, CI, Br, I)- 
Cj-C^alkyl, halo(F, CI, Br, IK^-C^lkoxy, hydroxy-C^C^lkyl, cyano, 

isocyanate, carboxy-C^Cj jalkyi, amino, Cq-Cj ^lkyl-amino-^-Cgalkyl), C Q - 

C^alkyl-amino-di-CCj-Cgalkyl), aminocarbonyl, Cj-C^alkylcarbonylamino, 

carboxamido, carbamoyl, carbamoyloxy, formyl, formyloxy, azido, nitro, hydrazide, 
hydroxamic acid, imidazoyl, ureido, thioureido, thiocyanato, hydroxy, Cj-Cgalkoxy, 

mercapto, sulfonamido, het, phenoxy, phenyl, benzyl, benzyloxy, benzamido, tosyl, 
morpholino, morpholinyl, piperazinyl, piperidinyl, pyrrolinyl. imidazolyl and indolyl; 
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R a ' is selected from the group of C 0 -C 1() alkyl-Q-C 0 -C 6 alkyl, C 0 -C 1() alkenyl- 
Q-C 0 -C 6 alkyl, C 0 -C 1() alkynyl-Q-C 0 -C 6 alkyl, Cj-Cj ^ycloalkyl-Q-Cg-Cgatkyl, 
C 10 cycloalkeny]-Q-C 0 -C 6 alkyl, C^C^Ikyl-Cg-C^aryl-Q-C^Cgalkyl, C^C^aryl- 
5 Cj-C^alkyl-Q-CQ-Cgalkyl, Chalky l-het-Q-Cg-Cgalky I, Cg-C^lkyl-Q-het-Cg- 
C 6 alkyl, het-C 0 -C 6 alkyl-Q-C 0 -C 6 alkyl, Cg-Cgalkyl-Q-Cg-C^aryl and Q-C^alky, 

where any aryl or het is optionally substituted with 1-3 R d and any alkyl, alkenyl or 
alkynyl is optionally substituted with 1-3 R a ; 

JO 

R a and R a may join to form a 3-7 member homocyclic ring substituted with 1- 

3R a ; 

RC is selected from hydrogen and substituted or unsubstituted; amino, O-Cj- 
C g alkyl, amino-(C 1 -C g alkyl), amino-dHCj-Cgalkyl), Cj-C^alkyl, C 2 -C 1() alkenyl, 
C 2 -C 1() alkynyl, C 3 -C 11 cycloalkyl, C 3 -C 10 cycloalkenyl, Cj-C^lkyl-Cg-C^aryl, 
C 6" C 10 aryl " C r C 6 alky1 ' c i- c 6 alk y l - hct > het-Cj-Cgalkyl, Cg-C^aryl and het, where 
the substituits on any alkyl, alkenyl or alkynyl are 1-3 R a and the substituents on any 
20 aryl or het are 1-3 R d ; 

R d is selected from R h and R p ; 

R h is selected from the group OH, OCF 3 , OR c , SR m , halo(F, CI. Br, I), CN, 
isocyanate, N0 2 , CF y C 0 -C 6 alkyl-NR n R n ', C 0 -C 6 alkyl-C(=O)-NR n R n ', C Q - 

25 

C 6 alkyl-C(=0)-R a , Cj-Cgalkyl, Cj-Cgalkoxy, C 2 -C g alkenyl, C 2 -C g alkynyl, 

Cgcycloalkyl, C 3 -C 6 cycloalkenyl, Cj-Cgalkyl-phenyl, phenyl-Cj-CgalkyI, 

C 6 alkyloxycarbonyl, phenyl-C 0 -C 6 alkyloxy, Cj-Cgalkyl-het, het-Cj-Cgalkyl, S0 2 - 

30 het, 0-C 6 -C 12 aryl, SO^Cg-C^aryl, SO^C j -C^alkyl and het, where any alkyl, 

alkenyl or alkynyl may optionally be sustituted with 1-3 groups selected from OH, 
halo(F, CI, Br, I), nitro, amino and aminocarbonyl, where the substituents on any aryl 
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or het are 1-2 hydroxy, halo(F, CI, Br, I), CF 3 , Cj-C^alkyl, Cj-Cgalkoxy, nitro and 
amino; 

R m is selected from hydrogen, S-Cj-Cgalkyl, C(=0)-C 1 -C 6 alkyl, C(=0)- 

5 NR n R n ', C r C 6 alkyl, haIo(F, CI, Br, I)-C r C 6 alkyl, benzyl and phenyl; 

R n is selected from the group R c , OH, OCF 3 , OR°, CN, isocyanate, NH- 

C(=0)-0-R c , NH-C(=0)-R c , NH-C(=0)-NHR c , NH-S0 2 -R s , NH-S0 2 -NH-C(=0)- 

, 0 R c , NH-C(=0)-NH-S0 2 -R s , C(=0)-0-R°, C(=0)-R c , C(=0)-NHR c , C(=0)-NH- 

C(=0)-0-R°, C(=0)-NH-C(=0)-R c , C(=0)-NH-S0 2 -R s , C(=0)-NH-S0 2 -NHR c , 

S0 2 -R s , S0 2 -0-R°, S0 2 -N(R c ) 2 , S0 2 -NH-C(=0)-0-R°, S0 2 -NH-C(=0)-0-R° and 

S0 o -NH-C(=0)-R c ; 
15 2 

R° is selected from hydrogen and substituted or unsubstituted C,-C,alkyl, C~- 

1.6 0 

C 6 alkyl-C 6 -C 1() aryl, Cj-Cgalkylcarbonyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 3 - 
C g cycloalkyl and benzoyl, where the substituits on any alkyl are 1-3 R a and the 

20 

substituents on any aryl are 1-3 R^; 

R p is selected from the group; OH, halo(F, CI. Br, I), CN, isocyanate, OR 0 , 
SR m , SOR°, N0 2 , CF 3 , R c , NR n R n ', N(R n )-C(=0)-0-R°, N(R n )-C(=0)-R C , C Q - 

25 C 6 alkyl-S0 2 -R s , C 0 -C 6 alkyl-SO 2 -NR n R n ', C(=0)-R c , 0-C(=0)-R c , C(=0)-0-R° 

and C(=0)-NR R , where the substituits on any alkyl, alkenyl or alkynyl are 1-3 R a 
and the substituents on any aryl or het are 1-3 R d ; 

R s is a substituted or unsubstituted group selected from; C^Cgalkyl, Cy 
C g alkenyl, C 2 -C g alkynyl, C 3 -C g cycIoalkyl, C 3 -C 6 cycloalkenyl, C 0 -C 6 alkyl-phenyl, 
phenyl-C 0 -C 6 alkyl, C 0 -C 6 alkyl-hel and het-C 0 -C 6 a]kyl, where the substituits on any 
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alkyl, alkenyl or alkynyl are 1-3 R a and the substituents on any aryl or het are 1-3 R d ; 

het is any mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring where 
at least one ring is a 5-, 6- or 7-membered ring containing from one to four 
heteroatoms selected from the group nitrogen, oxygen, and sulfur, the 5-membered 
5 ring having from 0 to 2 double bonds and the 6- or 7-membered ring having from 0 to 
3 double bonds and where any carbon or sulfur atoms in the ring may optionally be 
oxidized, and where any nitrogen heteroatom may optionally be quaternized and where 
any ring may contain from 0-3 R d ; 

U is an optionally substituted bivalent radical selected from the group; Cj- 
C 6 alkyl, C 0 -C 6 alkyl-Q, C^alkenyl-Q, and C 2 -C 6 alkynyl-Q, where the substituits 
on any alkyl, alkenyl or alkynyl are 1-3 R a ; 

Q is absent or is selected from the group; -O-, -S(O) -, -SO.-N(R n )-, -N(R n )- f 

S 2* 

-N(R n )-C(=0)-, -N(R n )-C(=0)-0-, -N(R n )-S0 2 -, -C(=0)-, -C(=0)-0-, -het-,-C(=0)- 

N(R n )-, -PO(OR c )0- and -P(0)0-, where s is 0-2 and the heterocyclic ringis 
substituted with 0-3 R h ; 

V is absent or is an optionally substituted bivalent group selected from Cj- 
C 6 alkyl, C 3 -C g cycloalkyl, C 0 -C 6 alkyl-C 6 -C 1() aryl, and C^alky-hel, where the 

substituits on any alkyl are 1-3 R a and the substituents on any aryl or het are 1-3 R d ; 

W is selected from the group; hydrogen, -OR°, -SR m , -NR n R n ', -NH-C(=0)- 
25 O-R 0 , -NH-C(=0)-NR n R n \ -NH-C(=0)-R c , -NH-S0 2 -R s , -NH-S0 2 -NR n R n ', -NH- 

S0 2 -NH-C(=0)-R c , -NH-C(=0)-NH-S0 2 -R s , -C(=0)-NH-C(=0)-0-R°, -C(=0)-NH- 

C(=0)-R c , -C(=0)-NH-C(=0)-NR n R n ', -C(=O)-NH-SO 0 -R s , -C(=0)-NH-SO - 

NR n R n ', -C(=S)-NR n R n \ -S0 2 -R s . -S0 2 -0-R°, -S0 2 -NR n R n ', -SC> 2 -NH-C(=0)-0- 

R°. -S0 2 -NH-C(=0)-NR n R n ', -S0 2 -NH-C(=0)-R c . -0-C(=0)-NR n R n ', -0-C(=0)- 



20 



10 
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R c , -0-C(=0)-NH-C(=0)-R c , -0-C(=0)-NH-S0 2 -R s and -0-S0 2 -R s ; 

Optionally, TBF ro -part A together with TBF^part B and TBF„-part C together 
with TBF„-part D may independently form (Cycle 1) substituted with -B-(Cycle 2)-E.. 

5 

31. A compound made by the method of Claim 30. 
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FIGURE 5 
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FIGURE 9 

TARGET BINDING FRAGMENT ASSEMBLY 



Screen Library of Candidate Target Binding Fragments 

High Solubility, Linkable Functional Group, MW ~ 200da 



Select Winning Fragments by Physical Measurement 

(e.g. ELISA) 

(Kd-nominal: 5mM-100u.M) 



Cross-link Combinations of Winning Fragments 

(MW ~ 500da) 



Screen Winning Cross-linked Fragments 

(Kd < 0.5 \xM) 
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